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The US government must not wriggle out of paying compensation to the victims of horrific 


experiments in Guatemala in the 1940s. 


Washington DC to throw out a lawsuit filed last year by the 

survivors of shameful 1940s experiments in Guatemala that 
were funded by the US government. More than 1,300 Guatemalans, 
almost all of them poor or otherwise vulnerable, were intentionally 
exposed to the diseases syphilis, gonorrhoea or chancroid without 
their consent. The experiments also took blood and cerebrospinal 
fluid from 5,128 adults and children, again with no consent. Of those 
infected, unknown numbers contracted the diseases and died; others 
suffered for decades, in some cases infecting their spouses or their 
children (see Nature 482, 148-152; 2012). 

US President Barack Obama, Secretary of State Hillary Clinton and 
Secretary of Health and Human Services Kathleen Sebelius were quick 
to issue public apologies when news of the heinous experiments on 
prisoners, prostitutes, orphans, soldiers and patients with leprosy or 
mental illness broke in 2010. Obama also asked his bioethics commis- 
sion to investigate. It issued a damning, 200-page report last September. 

But when lawyers for the victims asked US Attorney General Eric 
Holder to set up a claims process that would meaningfully compensate 
the survivors, they hit a wall of silence. 

So in March 2011, they sued senior officials at agencies including the 
National Institutes of Health in Bethesda, Maryland, and the Centers 
for Disease Control and Prevention in Atlanta, Georgia, asking for 
compensatory and punitive damages. These agencies and others 
funded and oversaw the Guatemalan experiments in the name of 
science 65 years ago. 

In a court filing last week, the US government reiterated a stance 
that it first took in January: that the court should throw the case out 
because government officials are shielded from lawsuits that arise from 
actions taken in the course of performing their jobs, and because cur- 
rent officials cannot be held responsible for the acts of their predeces- 
sors decades ago. 

The United States may be in the right, legally. It can be argued that 
no one would work for the government if they could be sued every 
time their actions, never mind those of their predecessors, had a nega- 
tive impact on a member of the public. But the government is in the 
wrong, morally, in refusing to find a way to compensate the survivors. 
These people and their families have borne decades of pain, misery and 
poverty on the heels of experiments called “reprehensible” in a joint 
statement by Clinton and Sebelius, and for which Obama, ina personal 
phone call with the Guatemalan president, expressed his “deep regret”. 

These apologies will ring hollow until officials establish a com- 
pensation fund that is outside the court system. (The courts could 
take months or years to reach a conclusion in this case, and time is 
of the essence: already, one of the elderly plaintiffs has died since the 
lawsuit was filed.) 

There is plenty of precedent for reparations for unethical medi- 
cal experimentation. The United States paid US$37,500 and lifetime 


r | The US Department of Justice is trying to persuade a court in 


health benefits to each living survivor of the Tuskegee syphilis 
experiments, which involved 399 poor African American tenant 
farmers in Alabama. Those US-funded experiments left the syphilis- 
infected farmers untreated between 1932 and 1972. 

In 1996, the US government paid $4.8 million in compensation for 
injecting 12 people, most of them hospital patients being treated for 
unrelated illnesses, with plutonium and uranium in the mid-1940s. 
Those compensated included one survivor and 11 family members 

of deceased victims. And earlier this year, a 


“There is a task force in North Carolina recommended 
compelling moral _ that victims of that state’s forced-steriliza- 
case that the tion programme, which ended in 1974, each 
US government receive $50,000. 

owes reparations There is a compelling moral case that the 
tothe US government — a continuous institution 


since the time of the horrific Guatemalan 
experiments — owes reparations to the 
Guatemalans affected. Senior officials clearly recognize this: the day 
after the Department of Justice first asked the court to throw out the 
case, the Department of Health and Human Services announced that 
it would spend $775,000 to prevent and treat sexually transmitted 
diseases in Guatemala. It pledged another $1 million to evaluate the 
effectiveness of changes being made to the way human subjects are 
protected in medical experiments. 

These measures are necessary, but not sufficient. The administra- 
tion must find a way to put money in the pockets of those who paid 
the highest price for these evil acts — and quickly. If it cannot, or will 
not, Congress should step into the breach. Justice demands no less. = 


Guatemalans.” 


No shame 


The handling of results suggesting faster-than- 
light neutrinos was amodel of fitting behaviour. 


a particle had been found that might travel faster than the speed 

of light. Most people were probably vague about what the par- 
ticle was, but they seemed to grasp the significance. The Universe's 
speed limit was in doubt, and anything might be possible. The result, 
announced by scientists at the OPERA neutrino experiment in Gran 
Sasso, Italy, may have been wrong, but the message conveyed about 
science was not. Late last month, following a vote of no-confidence in 
their leadership, OPERA’ two top scientists resigned. Yet both men, 
along with the rest of the collaboration, can hold their heads high. 


I: the public learned one thing about physics last year, it was that 
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The vote and the resignations have not been officially blamed on 
the media circus over the faster-than-light neutrinos, which OPERA 
brought to public attention and was then forced to admit did not exist 
after all. But that is how they will be interpreted. And if the vote is 
portrayed as a referendum on how OPERA handled the situation, then 
scientists everywhere should think carefully about how they would 
have voted. What kind of science do they want? 

The neutrino story is familiar to most researchers now, but here 
are the highlights. OPERA was measuring a beam of neutrinos 
coming from CERN, Europe's high-energy physics lab near Geneva, 
Switzerland. Contrary to everything taught in modern physics, the 
neutrinos seemed to be arriving 60 nanoseconds faster than light 
speed. A small sub-team of researchers responsible for the measure- 
ment spent months systematically checking OPERAs detector and 
could find no reason for the discrepancy. 

When the smaller group shared their result with the full OPERA 
collaboration, it leaked to the Italian press. Faced with growing inter- 
est, OPERA’s leaders — Antonio Ereditato and Dario Auterio, the duo 
who have now resigned — decided to go public with a seminar. 

Physicists saw plenty of reasons to doubt OPERAS extraordinary 
claim, and Ereditato and Auterio did not disagree. Even as they 
presented the result, they invited their colleagues to comment, and 
encouraged others to try to reproduce their results. Within months, 
CERN had sent a new beam of neutrinos to Italy and a second experi- 
ment found neutrinos travelling at the expected speed. After a great 
deal of searching, members of the OPERA group eventually traced the 
discrepancy to a cable that was not fully screwed in. 

Scientists both inside OPERA and out have since fretted about 
what such a high-profile misstep might mean for funding, reputation 
and the public’s perception of science. In fact, OPERA’ handling of 
the incident, at least publicly, was a model for how scientists should 
behave. Ereditato and Auterio acted responsibly when speaking pub- 
licly by sticking close to their data and avoiding over-interpretation. 
They shared their work with their competitors, and did their best to 


quickly address outside criticism. In the end, it was OPERAS internal 
checks that found the loose cable. When the error was discovered, 
physicists on the team wasted no time in publicly announcing the 

problem, along with others they had exposed during their review. 
Broadly speaking, the media and the public seemed to grasp that 
this is the way science is supposed to work. Some Italian journalists 
aside, the press responsibly reported the initial result as simultaneously 
incredible and very possibly wrong. The public 


“The message enjoyed the opportunity to question the world 
here is that around them and learned a little physics in the 
scientists are process. Media coverage generally sided with 
not afraid to the researchers for admitting they were wrong, 
question the and no one has called for funding to be cut. 


Science can fall victim to human frailties. 
One researcher hoards her samples out of fear of 
competition; another doggedly promotes his hypothesis long after the 
data have falsified it; negative results are hidden because of compet- 
ing financial interests. And the most frequent sin of all: questionable 
results go unchecked because it is in nobody’s interest to check them. 

The OPERA collaboration is not exempt from the human condi- 
tion. Some collaborators believe that publication was rushed out of a 
desire to beat the competition. But OPERA nevertheless conducted 
itself openly and properly. 

The no-confidence vote and resignations are a matter for the 
collaboration’s internal processes, and have no bearing on the 
quality of the collaboration’s science. But beyond OPERA itself, 
scientists should celebrate the way in which the results were dissemi- 
nated and the findings ultimately refuted. The process was open and 
deliberate, and it led to the correct scientific result. In an era in which 
politics, business and celebrity fixate on spin, control and staying 
‘on message, OPERAS rise and fall make science stand apart. The 
message here is that scientists are not afraid to question the big ideas. 
They are not afraid to open themselves to public scrutiny. And they 
should not be afraid to be wrong. = 


big ideas.” 


All together now 


The financial crisis brings Europe unexpected 
opportunities for international collaboration. 


sources, synchrotrons, telescopes and particle accelerators all 
require highly trained staff and lots of consumables. They need 
to be upgraded periodically for research to march forward, and the 
next generation must be even bigger and more ambitious. 
In the current fiscal environment, just keeping the lights on can be 
a struggle, but Europe’s facilities are finding a way. As we report on 
page 295, they have begun inviting non-European countries to join 
them. The decision is driven by short-term need, but the continent's 
organizations may also be positioning themselves for an era of scien- 
tific supremacy over rivals such as the United States and Japan. 
Europe understands the need for cooperation. Multinational labs 
bloomed throughout the second half of the twentieth century, a sign of 
both Europe’ post-war unity and the fact that no single nation could 
afford the types of facility that were being built elsewhere in the world. 
Fast-forward 50 years, and bodies such as CERN, Europe's high-energy 
physics lab near Geneva, Switzerland, and the European Southern 
Observatory (ESO), based in Garching, Germany, are internationally 
recognized. They have built instruments that are among the best in the 
world, and they are training a generation of European scientists. 
This year sees non-European members entering the fold for the first 
time. Last month, Israel became the first state from outside Europe to 


B ig pieces of scientific kit aren't getting any cheaper. Neutron 
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be represented on CERN’s governing council, and ESO will soon admit 
Brazil. Other bodies, including some facilities now under construction, 
are partnering with countries such as India. Even national facilities are 
joining in: after years of cooperation, Italy’s Elettra synchrotron light 
source near Trieste has partnered with India to build two beamlines. 

This expansion attracts cash in the short term, but it also lays the 
groundwork for a more fruitful future. Organizations such as ESO and 
CERN will gain the best researchers from nations around the world, 
raising their status and the quality of their research. They will also 
strengthen the scientific base in their partner countries, increasing the 
chances that these nations will be politically willing and scientifically 
able to help with the next big project. In the United States and Japan, by 
contrast, large national facilities are run by powerful research agencies 
that are difficult to approach and poorly equipped to engage in major 
financial and material cooperation at home and abroad. They will lose 
out in future: the facilities of tomorrow will be so big and costly that 
only those who can cooperate will win. 

Two examples show the strength of Europe's hand. The first is ITER, 
a multinational fusion experiment that is being built in southern 
France. ITER’s ballooning budget is giving Europe headaches in the 
short term, but its long-term benefits to the continent’ scientific and 
technical expertise are undeniable. The second project, the Square 
Kilometre Array radio telescope, will be not be located in Europe for 
scientific reasons, but European scientists have a central role in choos- 
ing where it will be built, and will figure prominently in its operation. 

There are risks, of course: chiefly that Europe 
will cede some control of its own infrastruc- 
ture to its new partners. But done properly, the 
shared benefits can keep Europe on the top of 
the world for decades. m 
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water Horizon rig in the Gulf of Mexico, the United States is 

largely failing to act on the lessons learned from that experience 
to ensure that deep-water drilling and production is safe and environ- 
mentally compatible. In particular, the US Congress has not enacted 
any legislation to improve safety and protect the environment. Mean- 
while, high oil prices are stimulating the expansion of drilling into ever 
deeper waters in the Gulf of Mexico, as well as off Brazil, Africa and 
Europe. Drilling is also proceeding in shallower, but ice-prone, regions 
of the Arctic, including the Beaufort and Chukchi seas off Alaska. 

I am one of two scientists who served on the US president’s com- 
mission that produced the report Deep Water: The Gulf Oil Disaster 
and the Future of Offshore Drilling — the other was Cherry Murray 
of Harvard University. The commission con- 
cluded that the root causes of the blowout and 
explosion were deficiencies in regulatory over- 
sight and multiple poor decisions made in the 
absence of a comprehensive risk-management 
system. Other investigations into the disaster 
essentially agreed. 

We were impressed by the technologies 
developed to produce hydrocarbons from ever 
deeper, more highly pressured formations, but 
surprised by the lack of sophistication in tech- 
niques for detecting and controlling risk, con- 
taining the flow of hydrocarbons, collecting 
spilled oil and protecting vulnerable resources. 
For example, ‘down-hole events — those tak- 
ing place deep below the seabed — are often 
inferred from indirect measurements of pres- 
sure and volume rather than measured with state-of-the-art in situ 
sensors of the type used in geophysical research and other industries. 

Cement formulation, testing and placement — major factors in the 
blowout — seem to be more of an art than a science. Cementing is also 
central to debates on the increased recovery of hydrocarbons by hydro- 
fracturing, because it is critical both to limiting fugitive emissions of 
methane and to preventing contamination of shallower aquifers. 

The 2010 accident showed that no operating company in the world 
had the capacity to rapidly contain a deep-water blowout. It took 
months of seat-of-the-pants engineering to build and deploy a cap- 
ping stack that provided effective containment. Confusion reigned 
over the fate of the oil and gas released 1,500 metres below the surface, 
largely because of a lack of understanding of the operating environ- 
ment, including the direction and speed of water currents, and the 
behaviour of hydrocarbons released at depth. 


Ts years after the blowout of the BP oil well drilled by the Deep- 


There have been some positive responses to NATURE.COM 
the Deepwater Horizon experience. The US _ Discuss this article 
Department of the Interior (DOI) temporarily __onlineat: 
suspended deep-water drilling in the Gulf until _go.nature.com/6p74jh 


ROOT CAUSES OF THE 
BLOWOUT WERE 


DEFICIENCIES 


IN REGULATORY 


OVERSIGHT 


AND MULTIPLE 


POOR DECISIONS. 


Deep-water drilling 
remains a risky business 


Donald Boesch argues that the lessons learned from the Deepwater Horizon 
oil spill should be enshrined in legislation before they fade from memory. 


new safeguards, including a demonstrated capacity to contain blow- 
outs, were put into place. The DOI also reformed its management 
and oversight of offshore oil and gas development, separating safety 
regulation from developmental decision-making, and has established 
a new safety and environmental management system. 

The oil industry, in addition to developing the needed containment 
capacity, has improved its safety processes. The American Petroleum 
Institute, the industry body, has created a Center for Offshore Safety 
and named Charlie Williams, a seasoned and respected scientist from 
Royal Dutch Shell, to head it. We shall see whether this new centre 
can develop the planned third-party audit process and if the indus- 
try, working with the DOI, will advance cutting-edge research and 
development (R&D) of safety technology. 

BP has funded an independently man- 
aged Gulf Research Initiative to support 
longer-term research on the impacts of 
the Deepwater Horizon oil spill. Through 
this, and the natural resources damage 
assessment being conducted by state and 
federal agencies, we will learn much about 
the fate and the effects of the hydrocar- 
bons released during the blowout. And a 
task force established by President Barack 
Obama has developed the first phase of a 
Gulf of Mexico Regional Ecosystem Res- 
toration Strategy to address the long-term 
degradation of Gulf ecosystems, some of 
it due to the 60-year history of oil and gas 
development in the region. 

Unfortunately, the US Congress — caught 
up in partisan rancour, including debates about expanding offshore 
oil drilling — has failed to adopt legislation to address the lessons 
learned and the recommendations of the oil-spill commission and 
others. Such legislation should codify the executive reforms men- 
tioned earlier into law, increase liability limits, and dedicate sustained 
funding for oil-spill research and environmental assessment and 
monitoring. 

Even in the current constrained fiscal circumstances, improved 
oversight and essential R&D could be supported by industry fees 
amounting to pennies per barrel, imperceptible within the daily fluc- 
tuations in price on the world market or at the pump. 

New laws were passed within a year of the 1989 Exxon Valdez spill 
in Alaska. If important lessons are not to be lost as the events of 2010 
fade from memory, there is a pressing need to change the law to make 
such accidents less likely, and our response more effective. m 


Donald Boesch is president of the University of Maryland Center for 
Environmental Science in Cambridge. 
e-mail: boesch@umces.edu 
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RESEARCH HIGHLIGHTS 


How fat spurs 
inflammation 


Obesity increases the risk 

of metabolic conditions 

such as insulin resistance by 
triggering inflammation. 

Lan Wu and Luc Van Kaer 

at Vanderbilt University in 
Nashville, Tennessee, and their 
colleagues have established 
that a subset of immune cells 
called invariant natural killer T 
(iNKT) cells links fat build-up 
with inflammation. 

The authors fed mice a 
high-fat diet, which activated 
their iNKT cells, triggering 
the release of inflammatory 
proteins. When the animals’ 
iNKT cells were stimulated 
with an injected molecule 
for eight weeks, the mice 
developed insulin resistance 
and fatty livers. By contrast, 
mice that were engineered 
to lack iNKT cells did not 
develop excess inflammation 
or metabolic conditions 
ona high-fat diet, despite 
becoming obese. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1200498109 (2012) 


Injectable protein 
nanofactories 


Tiny injectable particles 
containing protein ‘factories’ 
can be remotely 
activated by 
ultraviolet 

light. The 
technique could 
ultimately be 
used for drug 
delivery. 

Daniel 
Anderson and 
his group at the 
Massachusetts Institute 
of Technology in Cambridge 
created lipid spheres more 
than 100 nanometres in 


Selections from the 
scientific literature 


ECOLOGY 


Can coral cope with climate change? 


Climate change is likely to alter the species 
composition of coral reefs, rather than wipe out 
entire reef ecosystems. 

Terry Hughes at James Cook University in 
Townsville, Australia, and his team sampled 
132 sites along the full length of the Great Barrier 
Reef (pictured), spanning 13° of latitude anda 
range of sea surface temperatures. Of the 12 coral 
taxa sampled, 11 showed significant differences 


diameter that contained DNA 
and all the cellular ingredients 
and machinery needed to 
make proteins. By tagging the 
DNA with a chemical group 
that prevents it from being 
transcribed into RNA but can 
be removed using ultraviolet 
light, the researchers 
were able to control 
activation of RNA 
and protein 
production. 
The authors 
developed 
nanoparticles 
that produce 
green fluorescent 
protein (pictured) 
and luciferase, an 
enzyme often used for in vivo 
molecular imaging. They 
showed that they could switch 
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on luciferase production 
remotely after injecting the 
nanoparticles into mice. 
Nano Lett. http://dx.doi. 
org/10.1021/nl2036047 (2012) 


Recipes for 
making lung cells 


The development of methods 
to transform embryonic stem 
cells into lung tissue paves 
the way for models of lung 
diseases such as cystic fibrosis. 
Darrell Kotton at Boston 
University in Massachusetts 
and his team used a series 
of proteins to turn mouse 
embryonic stem cells into 
lung progenitor cells. When 
implanted into a mouse lung 


in abundance across the reef, regardless of how 
susceptible they were to thermal stress and 
bleaching. These differences in abundance did 
not follow changes in latitude or temperature. 
This flexibility may enable coral reefs to 
continue functioning as the environment alters 
with climate change. 
Curr. Biol. http://dx.doi.org/10.1016/j. 
cub.2012.02.068 (2012) 


that had been removed from 
the animal and stripped of its 
cells, the progenitors formed 
structures similar to alveoli, 
sacs in the lungs where gas 
exchange occurs. The cells 
also expressed proteins 
characteristic of specialized 
lung cells. 

Jayaraj Rajagopal at 
Massachusetts General 
Hospital in Boston and his 
group took the worka step 
further, reprogramming 
skin cells from a patient with 
cystic fibrosis into stem cells 
and then converting these 
into lung progenitors. When 
implanted underneath the 
skin of mice, the progenitors 
formed different kinds 
of specialized lung cells, 
including those lining the 
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airway that are disturbed in 
cystic fibrosis. 

Cell Stem Cell 10, 398-411; 
385-397 (2012) 


NEUROPSYCHOLOGY 


Poor attention 
linked to dyslexia 


Early problems with visual 
attention in children seem to 
be linked to later difficulties in 
learning to read, a hallmark of 
dyslexia. 

Andrea Facoetti at the 
University of Padua in Italy 
and his colleagues tested 
96 Italian-speaking children in 
kindergarten on their ability 
to pick out specific symbols 
from an array of others, as 
a measure of their visual- 
attention skills. Those who 
scored below average in visual 
attention were more likely to 
show below-average reading 
skills when they reached first 
and second grade. 

The results suggest that 
visual-attention deficits, rather 
than developmental delays in 
language skills, could predict 
dyslexia diagnoses in children, 
and could inspire strategies for 
early detection and treatment. 
Curr. Biol. http://dx.doi.org/ 
10.1016/j.cub.2012.03.013 
(2012) 


Multiple malaria 
mechanisms 


The considerable variation 
in malaria symptoms seen 
between patients can be 
predicted from the parasite’s 
expression of certain genes 
and the response of the host's 
red blood cells to infection. 
Red blood cells infected 
by the malaria parasite 
Plasmodium falciparum often 
stick to uninfected ones ina 
process called rosetting. Peter 
Bull and his co-workers at 
the Kenya Medical Research 
Institute-Wellcome Trust 
Research Programme in Kilifi 
studied parasites isolated 
from 131 infected children 
and found that patients with 
high levels of rosetting tended 
to have breathing difficulties. 


Children infected with 
parasites that did not exhibit 
high rosetting but expressed 
high levels of a particular 
subset of ‘var’ genes were more 
likely to exhibit impaired 
consciousness or coma. Var 
genes encode proteins that sit 
on the surface of infected red 
blood cells and specific subsets 
have previously been linked to 
severe malaria. 

The findings could help to 
guide treatment decisions or 
lead to new therapies. 

Sci. Transl. Med. 4, 129ra45 (2012) 


PHOTONICS 


Diamond sparkles 
with one photon 


A diamond diode can produce 
single photons at room 
temperature in response to a 
small electrical current. 
Devices that can produce 
one photon at a time have 
potential applications in 
quantum communication 
and computation, but usually 
require very low temperatures. 
Norikazu Mizuochi at Osaka 
University in Japan and his 
colleagues say that the key 
feature of their diode is a 
diamond layer that contains 
gaps in the carbon lattice 
occupied by nitrogen atoms. 
The researchers theorize 
that these nitrogen-vacancy 
complexes naturally capture 
positively charged ‘holes’ 
and electrons from a small 
electrical current that 
recombine and emit light. 
Nature Photon. http://dx.doi.org/ 
10.1038/nphoton.2012.75 (2012) 


| NEUROSCIENCE 
Fragile X fixed 

in mice 

A molecule that inhibits a 
neurotransmitter receptor 
reverses many of the 
symptoms in a mouse model 
of fragile X syndrome, the 
most common form of 
inherited mental retardation. 
The disorder is caused by 
mutations in the FMRI gene 
that result in overactivity of 


the mGlu5 receptor. 
Lothar Lindemann at 
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Snail drives implanted fuel cell 
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in Marct months. 


A fuel cell implanted in a living snail 
can generate electrical power for several 


Implanted fuel cells that are driven by 
glucose generated by their host could one day power medical 
devices in humans or environmental sensors in animals. 
Evgeny Katz at Clarkson University in Potsdam, New York, 
and his colleagues made the electrodes of their fuel cell out 
of densely packed carbon nanotubes, and attached glucose- 
oxidizing and oxygen-reducing enzymes to them. The 
authors then implanted the electrodes into a snail (Neohelix 
albolabris). After decreasing the rate of current extraction 
to match the snail’s slow glucose transport and metabolism, 
they got continuous electrical output for an hour. The fuel cell 
remained functional in the snail for several months. 

J. Am. Chem. Soc. 134, 5040-5043 (2012) 


EF Hoffmann-La Roche in 
Basel, Switzerland, Mark Bear 
at the Massachusetts Institute 
of Technology in Cambridge 
and their colleagues treated 
young adult mice engineered 
to have fragile X syndrome 
with a new long-acting mGlu5 
inhibitor, CTEP, for up to 
four months. The treatment 
reversed a range of defects, 
including hyperactivity, 
impaired learning and 
memory, and abnormal 
neuronal shape. CTEP also 
corrected the elevated rate of 
protein synthesis in the brain’s 
hippocampal region, which is 
characteristic of the disease. 
Several drug companies are 
testing mGluS inhibitors in 
clinical trials. 
Neuron 74, 49-56 (2012) 


Predicting the 
Indian monsoon 


Climate models can produce 
more-accurate forecasts of 
rainfall during India’s summer 
monsoon season than can 
conventional statistical analyses 
of sea surface temperatures. 
Timothy DelSole and 
Jagadish Shukla at George 
Mason University in Fairfax, 
Virginia, found no significant 


statistical relationship between 
summer monsoon rainfall 
in India and sea surface 


temperatures surrounding 
the country in May of each 
year from 1960 to 2005. 
They analysed predictions 
made by five coupled 
ocean-atmosphere climate 
models based on the same 
temperature data and period, 
and found that the models 
were better than the statistical 
methods at predicting rainfall. 
The authors say the models 
can more accurately forecast 
the evolution of sea surface 
temperatures throughout the 
summer, which influences the 
monsoon rainfall (pictured). 
Geophys. Res. Lett. http://dx.doi. 
org/10.1029/2012GL051279 
(2012) 
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SEVEN DAYS nescwiss 


Controversy in class 


School teachers in Tennessee 
now have licence to critique 
mainstream scientific theories 
often attacked by religious and 
political conservatives, such as 
evolution and climate change. 
A bill that had passed both 
state legislative houses with 

an overwhelming majority 
automatically became law on 
10 April, after state governor 
Bill Haslam (Republican) 
neither signed nor vetoed 

it. The law’s ramifications 
remain unclear, but opponents 
worry that it could result in 
the teaching of religious or 
non-scientific theories in 
science classrooms. Tennessee 
is the second US state to enact 
a law of this kind; the first 

was Louisiana in 2008. See 
go.nature.com/iy5xcd 

for more. 


Antibiotic curb 
Spurred by long-running 
concerns about the rise of 
resistant strains of bacteria, 
the US Food and Drug 
Administration (FDA) 
announced on 11 Aprila 
multi-pronged initiative to cut 
back on the use of antibiotics 
in farm animals. Guidance 
documents recommend that 
drug firms remove ‘growth’ 
and ‘production uses from 
their product labels; that 
farmers use certain antibiotics 
only for therapeutic purposes; 
and that veterinary surgeons 
step in as watchdogs. Because 
these measures are voluntary 
for the moment, sceptics say 
that the initiative lacks the 
punch of robust regulation. 
See go.nature.com/nwn2nf 
for more. 


Fracking go-ahead 
Scientists appointed by 

the UK government have 
recommended that hydraulic 
fracturing, or fracking — 

in which fluid is pumped 


North Korean rocket fails 


After much hype, North Korea’s latest space shot ended in 
failure shortly after lift-off on 13 April. Seconds after launch, 
the first stage of the Unha-3 rocket plunged into the Yellow 
Sea, 165 kilometres west of Seoul. The United States military, 
which tracked the launch, says that the rest of the rocket 
seems to have quickly followed, along with the small weather 
satellite that it was carrying. The failure is the nation’s third 
since 1998. Unlike with previous attempts, the North Korean 
government acknowledged the mishap in a brief statement to 


the press. 


into rock to release natural 
gas — should continue, 
despite the risk of triggering 
earthquakes. On 17 April, 

the panel published a report 
recommending a preliminary 
injection stage with near-real- 
time monitoring of seismic 
activity. It advises halting 
fracking after an earthquake 
of magnitude 0.5 or greater. 
Last year, a report by Cuadrilla, 
an energy company based 

in Lichfield, UK, proposed a 
threshold of 1.7. Both reports 
were commissioned following 
earthquakes in April and May 
2011, of magnitude 2.3 and 
1.5 respectively, near a UK 
Cuadrilla fracking well. See 
go.nature.com/jgciyf for more. 
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Stem-cell safety 
Regulations for the safe 
clinical use of stem cells 

and other “investigational 
agents” approved by the Texas 
Medical Board on 13 April 
have been criticized for being 
too permissive and opening 
the door to the sale of dubious 
treatments. They require those 
who wish to provide stem cells 
to patients to gain approval 
either from the US Food 

and Drug Administration, 
which requires most 
stem-cell treatments to be 
clinically verified, or from an 
institutional review board. 
The latter hurdle is likely 

to be lower. The rules were 
introduced by Governor Rick 
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Perry, who had stem-cell 
treatment for back pain last 
summer, and approved by a 
board of Perry appointees. 
See go.nature.com/owy9or 
for more. 


Emissions up 
Greenhouse-gas emissions in 
the United States rose by 3.2% 
in 2010, marking the end ofa 
two-year slide that coincided 
with the global economic crisis, 
according to an inventory 
released by the Environmental 
Protection Agency on 16 April. 
Cumulative emissions 

rose to the equivalent of 

6.8 billion tonnes of carbon 
dioxide, driven by fossil fuel 
consumption in the electricity, 
industrial and transport 
sectors. Despite the sharp 
increase from 2009, emissions 
remain more than 6% below 
their 2007 peak. Australia’s 
emissions, meanwhile, rose by 
0.6% in 2011, to the equivalent 
of 546 million tonnes of 
carbon dioxide, according to 
government figures released on 
17 April. 


| __BUSINESS 
Drug fines 


Improper marketing of 
antipsychotic drugs is 
attracting further fines. 

Last week, US health-care 
giant Johnson & Johnson, 
with its subsidiary Janssen 
Pharmaceuticals, was ordered 
to pay the state of Arkansas 
more than US$1.2 billion for 
playing down risks associated 
with the blockbuster 
antipsychotic Risperdal 
(risperidone), including 
increased probability of 
death in some trial groups. A 
string of states are suing the 
firm, which is headquartered 
in New Brunswick, New 
Jersey; fines have so far been 
ordered in South Carolina 
($327 million) and Louisiana 
($258 million). Johnson & 
Johnson has appealed both 


KYODO/PHOTOSHOT 


y, decisions, but agreed to pay 
$158 million to Texas ina 
settlement. In 2009, Eli Lilly 
of Indianapolis, Indiana, 
agreed to pay $1.4 billion in 
fines related to its marketing 
of another antipsychotic drug, 
Zyprexa (olanzapine). 


Biologist sacked 


Protein crystallographer 
Robert Schwarzenbacher has 
been fired by the University of 
Salzburg in Austria following 
allegations that he fabricated 
the molecular structure of 
aversion of a birch-pollen 
allergen, published in the 
Journal of Immunology in 
2010 (N. Zaborsky et al. 

J. Immunol. 184, 725-735; 
2010). The article has not been 
retracted but the structure 
(code 3K78) has been 
removed from the Protein 
Data Bank. Schwarzenbacher 
admitted fabrication at first 
(N. Zaborsky et al. Acta Cryst. 
F68, 377; 2012), but then 
retracted his statement. Ina 
further statement to Austrian 
news agencies, he then said 
that the structure contains 
errors but they were not 
fraudulent. He is fighting his 
dismissal. 


YASBANT NEGI/INDIA TODAY GROUP/GETTY IMAG 


Scientists arrested 


The arrest of two scientists in 
one week by police in Kolkata 
has triggered outrage in India. 
Ambikesh Mahapatra, a 


chemist at Kolkata’s Jadavpur 
University, was arrested on 
13 April after he circulated a 
political cartoon that made 
fun of Mamata Banerjee 
(pictured), the first woman 
to be chief minister of West 
Bengal. His arrest follows 
that of Partho Sarothi Ray, 

a molecular biologist at the 
Indian Institute of Science 
Education and Research 
Kolkata, who was arrested for 
protesting against the forcible 
eviction of slum dwellers in 
East Kolkata. See go.nature. 
com/blo21v for more. 


Astrophysicist death 
No criminal charges will be 
brought over the death of 
astrophysicist Steven Rawlings 
of the University of Oxford, 
UK. Rawlings was found 

dead at the home of his friend 
and fellow Oxford academic 
Devinder Sivia in January 

(see go.nature.com/kuiphg). 
Sivia was arrested and released 


SOURCE: WHO 


TREND WATCH 


An estimated 35.6 million people 
were living with dementia in 
2010, according to a World 
Health Organization report 
released on 11 April, which urged 
governments to make plans for 
coping with the economic and 
social burden of brain illnesses. 
As people live longer, cases of 
dementia are projected to reach 
115 million in 2050 (see chart). 
That estimate assumes current 
prevalence, which in people over 
60 ranges from 8.5% in Latin 
America to 2.1% in sub-Saharan 
Africa. 


RISING TIDE OF DEMENTIA 


on bail, but the UK Crown 
Prosecution Service now 
says that he will not face any 
charges. A coroner’s inquest 
will seek to determine the 
cause of death. Rawlings was 
a key figure in the Square 
Kilometre Array radio 
telescope project. 


Sumatra quake 


Seismologists were surprised 
at the power of an undersea 
earthquake of magnitude 

8.6 off the west coast of Aceh 
on the Indonesian island of 
Sumatra on 11 April. The 
quake caused few fatalities, 
and there was no major 
tsunami because it was at a 
‘strike-slip fault, in which 
sections of crust rub together 
horizontally, rather than 
thrusting over or under each 
other. It may be the largest 
known such quake, apart 
from one of magnitude 8.6 
in northeast India and Tibet 
in 1950. See go.nature.com/ 
nwcg7a for more. 


Satellite silence 
Officials with the European 
Space Agency (ESA) say 

that they have lost contact 
with Envisat, the agency's 
premier Earth-observing 
satellite. Launched in 2002, the 
satellite is billed as the most 
sophisticated environmental 
monitor in orbit, with ten 
instruments providing streams 


The number of people with dementia is expected 


to triple by 2050* as populations age. 


= Low- and middle- 
income countries ee 
_ High-income countries 


People with dementia (millions) 


201 2020 2030 


2040 2050 


*Extrapolations based on world dementia prevalence of 4.7% in those aged 60 and over in 2010. 


SEVEN DAYS MIN 
COMING UP 


22-27 APRIL 

At the European 
Geosciences Union 
General Assembly in 
Vienna, researchers 
discuss the role of 
geoscientists in warning 
of and mitigating 
natural disasters. 
www.egu2012.eu 


26-27 APRIL 

US science-policy 
experts meet in 
Washington DC to 
discuss the nation’s 
science and technology 
prospects at the 37th 
Annual American 
Association for the 
Advancement of Science 
Forum on Science and 
Technology Policy. 
go.nature.com/eub4u7 


of valuable data on everything 
from ozone, clouds and 
greenhouse gases to land- 

use trends and sea surface 
temperatures. Its signal cut 
out on 8 April, and although 
the satellite remains in stable 
orbit around Earth, ESA could 
not establish contact as Nature 
went to press. See go.nature. 
com/9h5mom for more. 


China ethics 

More than 1,000 Chinese 
science journals administered 
by the China Association for 
Science and Technology in 
Beijing have jointly declared 
a code of ethics to prevent 
academic misconduct. State 
media reported on 10 April 
that publishers would punish 
editors who knowingly 
published plagiarized articles 
or abused their positions. The 
statement comes a month after 
China’s education ministry 
launched an effort to curb 
scientific misconduct at 
universities. 


> NATURE.COM 
For daily news updates see: 
WwW.nature.com/news 
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The construction of the European Extremely Large Telescope (artist’s impression) in Chile will rely on funds from Brazil. 


RESEARCH CONSORTIA 


Furopean groups go global 


International collaborations seek far-flung members to close budget gap. 


BY GEOFF BRUMFIEL 


r he European Organization for Nuclear 
Research (CERN) draws particle 
physicists from around the world to its 

headquarters near Geneva, Switzerland, but 
when they line up for lunch the meals are tra- 
ditionally French. Duck-leg confit and sliced 
beef liver are among the delicacies on this 
week's cafeteria menu. 

But falafel may soon become a staple. Last 
month, Israel took its seat on CERN’s council as 
an associate member — the first non-European 
country to do so in the organization's 50-year 
history. Other nations outside the European 
Union, including Serbia and Turkey, are 
expected to follow soon. For Rolf-Dieter Heuer, 
the lab’s director-general, the goal is to “redefine 
the E’ in CERN from ‘Europe to ‘everywhere’”. 
CERN is not alone in its internationalist over- 
tures: facilities across the continent are hoisting 
the flags of new, non-European members. 

Part of the motivation for wooing new 


partners is a genuine desire to broaden scien- 
tific collaborations, but it is no coincidence that 
the expansion coincides with the worst financial 
crisis in a generation. Many European nations 
are slashing their science spending, and inter- 
national labs are feeling the pressure to find 
savings and keep costs down’. Greece, facing 
a deepening crisis, has sought to reduce its 
fees to CERN and the European Space Agency 
(ESA), which currently total about €30 million 
(US$39 million). And states such as Portugal 
and Spain, which have made particularly deep 
cuts to their science budgets’, are almost certain 
to try to renegotiate their international obliga- 
tions. Meanwhile, fast-developing countries 
outside Europe have money to spend on science. 
Economics has always played a part in the 
evolution of Europe’s 
multinational research 


facilities. Many were  Formorescience 
set up as international _ stories from Europe, 
treaty organizations in see: 


the 1950s and 1960s 


as part of an effort to strengthen economic, 
social and industrial ties across Europe. They 
have gradually expanded their membership, 
but have traditionally drawn a line at the edge 
of the continent. Although large numbers of 
US and Japanese scientists work at CERN, 
for example, those nations are designated as 
observers on the lab’s governing council. 

That is changing fast. Last year, India joined 
the Institute Laue-Langevin (ILL) in Greno- 
ble, France, which produces beams of neutrons 
to probe the structures of materials and bio- 
chemicals. Together with Russia, India has also 
joined the €1.4-billion Facility for Antiproton 
and Ion Research (FAIR) now under construc- 
tion in Darmstadt, Germany. And later this 
year, Brazil is expected to ratify its membership 
in the European Southern Observatory (ESO), 
based in Garching, Germany’. 

The financial downturn has certainly been a 
factor in the decision to bring India in as a full 
member of the ILL, says Andrew Harrison, the 
facility’s director. The ILL’s nuclear reactor 
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> delivers neutron beams to dozens of instru- 
ments that have to be regularly upgraded to 
keep the facility globally competitive. With 
each upgrade costing between €3 million and 
€6 million, staying on the cutting edge eats up 
a significant chunk of the facility’s €88-mil- 
lion annual operating budget. India will bring 
hardware and an extra €1 million or so per year 
in cash. 


CRUCIAL PARTNERS 

The desire to fund entirely new facilities 
provides an even stronger motivation for 
inviting outside partners. Beatrix Vierkorn- 
Rudolph, who oversees large research facilities 
at the German Federal Ministry of Education 
and Research in Bonn, says that without sup- 
port from Russia and India, FAIR could not 
have gone ahead. The facility’s Russian-born 
director, Boris Sharkov, adds that FAIR is in 
negotiations with China, Brazil and Saudi Ara- 
bia about possible collaborations, which could 
help to fund new particle detectors. 

Brazil's admission into ESO will be crucial in 
paying for that organization’s next-generation 
telescope, the 40-metre European Extremely 
Large Telescope (E-ELT). Around one-third of 
the €1.1-billion construction budget will come 
from Brazil’s membership dues, says Tim de 
Zeeuw, director-general of ESO. 

The new members get plenty in return, 
benefiting from the prestige associated with 
international collaborations and the training 
offered to their scientists. “ESO facilities are the 
top of the line,” says Sergio Machado Rezende, 
a physicist at the Federal University of Pernam- 
buco in Recife, Brazil, and the country’s science 
minister from 2005 to 2010. ESO’s best tele- 
scopes are located in nearby Chile, which made 
joining the organization even more attractive, 
he says. Brazil’s astronomy community is cur- 
rently made up of only about 200 researchers, 


but Rezende says that involvement in ESO will 
help the community to grow and gain strength. 
Brazil’s robust economy means that the nation 
will probably also consider involvement in 
organizations such as CERN, he says. 

India’s membership of FAIR should provide 
benefits for the country’s technology firms, says 
Subhasis Chattopadhyay, a scientific officer 
at the Variable Energy Cyclotron Centre in 
Kolkata. The companies will supply power con- 
verters and magnets for the facility, and so gain 
expertise in constructing these hi-tech instru- 
ments. “Our goal was to get Indian industry 
building many new things,’ says Chattopadhyay, 
adding that India, too, has its eyeon CERN. 

Not every organization is looking to expand 
beyond Europe. “For the time being, there are 
really zero discussions,’ says Frederic Nordlund, 
acting head of international relations at ESA 
in Paris. His agency works closely with many 
nations, including Russia and India, but most 
want to maintain domestic expertise in space 

technologies, in part 


“If you get the for strategic reasons. 

best players, ESA has a long- 
then you standing partnership 
developthebest — with Canada, but the 
facilities.” agency’s current plans 
for expansion involve 


only new European Union member states. 

The European Molecular Biology Labora- 
tory (EMBL) in Heidelberg, Germany, stands 
apart for different reasons. “EMBL isn’t an 
organization that grows by building mega- 
projects,” says Iain Mattaj, director-general of 
the organization. As a result, it has no plans 
to open itself formally to outside membership 
— although it did allow Australia to join as an 
associate member in 2008. 

Those that are expanding recognize that the 
move comes with risks. Some fear that a growth 
in international membership could erode the 


quality of European science or make it more 
difficult to govern. But Harrison says that Euro- 
pean nations already on the ILL council are 
careful to ensure that any new members have 
a strong scientific community ready to use the 
facility. “We would not be interested in a mem- 
ber country that is rich but did not have the kind 
of user base that we could work with,” he says. 


SUSHI SURPRISE 

Things could get complicated if Europe’s 
centralizing instincts were to lead it to consol- 
idate facilities — merging ESA and ESO, for 
example, or combining all of Europe's particle- 
physics facilities. Although such mergers are 
not likely in the short term, the presence of 
many non-European partners would hamper 
them in the future, says Carlo Rizzuto, chair- 
man of both the board of the Elettra Labo- 
ratory near Trieste, Italy, and the European 
Association of National Research Facilities. 
But ultimately, Rizzuto thinks, more interna- 
tional members will only strengthen Europe's 
facilities, and its science. “If you get the best 
players, then you develop the best facilities,” 
he says. 

Heuer hopes that the CERN council's mem- 
bership will grow from 20 to around 26 in the 
next 5-6 years. At the same time, he stresses 
that the lab must maintain its European 
character of careful deliberation and consen- 
sus-building, which is one reason that new 
partners must first join as associate members. 
Heuer predicts that CERN will adapt smoothly, 
given that it already hosts many scientists from 
outside Europe. As he points out, the lab’s cui- 
sine has already begun to diversify: “The caf- 
eteria also sells sushi, by the way.’ m 


1. Brumfiel, G. Nature 466, 1028-1029 (2010). 

2. Catanzaro, M. Nature http://dx.doi.org/10.1038/ 
nature.2012.10242 (2012). 

3. Mann, A. Nature 469, 451-452 (2011). 


DISASTER PREPAREDNESS 


Tsunami simulations scare Japan 


Updated risk assessment warns country to prepare for much larger earthquakes and waves. 


BY DAVID CYRANOSKI IN TOKYO 


from last year’s Tohoku earthquake and tsu- 

nami: the underwater faults that encircle the 
country can unleash much greater devastation 
than previously anticipated. Last week, the cabi- 
net's disaster-management division briefed local 
officials on simulations that raise the spectre of 
waves even larger and more destructive than 
those last March, sending the officials scram- 
bling to rethink their tsunami defence plans. 


Jive government is heeding a key message 
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The estimates come from a government- 
appointed team of scientists led by Katsuyuki 
Abe, a tsunami expert and emeritus professor of 
the University of Tokyo. The team updated the 
nation’s main tsunami model by increasing the 
magnitude of the largest expected earthquake 
to incorporate recent massive quakes, including 
the magnitude-9.0 Tohoku and the magnitude 
8.8 that hit Chile in 2010, causing a tsunami 
that reached as far as Japan. The result: for most 
locations, the size of future tsunamis could far 
outstrip previous estimates, which were made 
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in 2003. The town of Kuroshio in Kéchi pre- 
fecture was steeled for a maximum 14.1-metre 
wave; it now faces the threat of a 34.4-metre 
inundation. Likewise, the offshore island of 
Niijima has to prepare for a 29.7-metre wave, 
rather than 5.4 metres (see ‘Making waves’). 
Ninety cities and towns must consider how 
to withstand a tsunami wave of higher than 
10 metres, whereas just ten locales were expect- 
ing this from the older simulations. Twenty- 
three have been told to prepare for a tsunami of 
20 metres or more, a threat none had previously 


MAKING WAVES 


A reassessment of tsunami risks along Japan’s 
southeastern coast suggests that many towns 
and cities must prepare for much bigger 
disasters than previously thought. 


anticipated. The team also raised the estimates 
of the risks posed by the largest earthquakes, 
with the number of towns and cities expecting 
the maximum level of ground shaking — 7 on 
Japan's intensity scale — raised from 35 to 153. 

The reappraisal focuses on the Nankai 
trough, an offshore fault south of Honshu, 
Japan’s main island, that regularly produces 
large quakes, including the magnitude-8.7 
Hoei quake of 1707 that set the ceiling for the 
2003 estimates. The latest simulations use the 
Tohoku quake, which was roughly three times 
more powerful than Hoei, as the maximum. 
They also take account of recent studies that 
used the thickness of sediment layers depos- 
ited by past tsunamis to estimate the size and 
frequency of major earthquakes and tsunamis 
that happened before accurate measurement 
methods existed. The 2003 models, which pre- 
dicted much smaller tsunamis than those that 
followed the Tohoku earthquake, have been 
criticized for not including such sedimentation 
data (see Nature 483, 141-143; 2012). 

The updated model also considers ‘large 
slip’ areas, in which extensive crust movement 
can make parts of tsunamis particularly hefty. 
Such areas were to blame for the unexpect- 
edly high waves in parts of Japan last year. The 
team considered how coastal regions would be 
affected in 11 tsunami scenarios, each of which 
had large slip areas in different places along the 
Nankai trough. 

Magnitude-9 events are expected to occur 
very rarely, perhaps once every millennium, 
says Kenji Satake, a tsunami expert at the Uni- 
versity of Tokyo and a member of the team that 
updated the models. How much preparation 
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is needed for such rare, devastating events 
remains a matter of debate, but the models 
have put local governments in a tight spot. 
Yukihiko Nakamura, head of earthquake pre- 
paredness in the Kéchi prefecture, says that 
the briefing was not detailed enough to give 
administrators a clear path forward. 

“We need to know how high the water will 
come at different points. Will it be 34 metres 
everywhere?” The government has promised a 
fine-grained simulation and inundation maps 
that detail water levels by the end of May. 

Even that information may not help offi- 
cials to determine how to prepare for such 
devastating events. Some towns are installing 
additional loudspeakers to warn citizens of an 
approaching tsunami, but they might give no 
more than a few minutes’ warning. Nakamura 
says it is unreasonable to think that people 
could seek shelter — by climbing evacuation 
towers tens of metres tall, for example — 
within that time. 

Officials in some regions are also making 
plans to move city offices to higher ground, 
and considering measures such as building 
large underground shelters or relocating large 
numbers of homes — both of which are costly 
options that are beyond the capacity of local or 
prefectural governments. A cabinet working 
group is discussing how the central govern- 
ment could help. 

Before last year’s devastating earthquake, 
“we had been preparing for a magnitude-8 
quake they said would have 60% chance of 
coming in the next 30 years’, says Nakamura. 
“But after Tohoku, we have to be prepared for 
the unexpected, too.” = 


| Previous estimate 
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Q&A I! 


FRANCO: 
™HOLLUNpE 


A ann of s science 


French scientists will have the chance to give their verdict on the root-and-branch reforms to the 
country’s science system on 22 April, when they vote in the first round of France’ presidential 
elections. Polls predict a defeat for Nicolas Sarkozy, the incumbent president and candidate for the 
right-wing UMP party. That result would cheer those researchers who argue that Sarkozy’ sweeping 
reforms, often imposed with little consultation, have left some institutions sidelined and many labs 
lacking funds. Others, however, support many of his research reforms, including his move to give 
universities more autonomy, and fear that the programme could stall if he loses power. 

Sarkozy's main rival, the Socialist Party’s Frangois Hollande, has consistently led in the polls 
and is predicted to win the second and final round on 6 May by a large margin. If he does, he will 
be the first socialist to take the Elysée Palace since Francois Mitterrand was re-elected in 1988. 
Hollande says that he will maintain the thrust of many of the major reforms begun by Sarkozy, 
but will ensure greater consultation with the research community as he takes them forward. He 
also promises to create many more science jobs, in particular for young researchers. Although 
none of the other eight candidates in the first round is likely to win, horse-trading for the transfer 
of votes in the second round could influence the policies of the eventual victor. 

Nature asked the two leading candidates about their plans for science; their full responses, in 
French and English, are available at go.nature.com/mjvmu4, with extracts appearing here. 
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NICOLAS SARKOZY 


What have been the main accomplishments of 
your presidency? 

Without hesitation, the 2007 law giving univer- 
sities autonomy from centralized state control. 
It has given universities greater freedom to set 
their own strategies, allows them to own their 
land and buildings, manage their budgets and 
borrow capital to make major investments. The 
universities can now recruit scientists without 
any ministry involvement — even though 
researchers are civil servants — and they 
have greater freedom to set salaries, through 
bonuses, instead of being tied to rigid national 
salary scales. Since 2007, university funding has 
also increased by 22%, and in 2008 we launched 
a €5-billion [US$6.5-billion] plan to refurbish a 
dozen select university campuses. 


Some scientists say that excellence initiatives 
are creating a two-tier system. How will you 
maintain research and teaching across France? 
Idex, the initiative for excellence funded by 
€7.7 billion from the €35-billion Investments 
for the Future programme, is exactly analogous 
to an initiative launched by Germany several 
years ago that has seen the emergence of nine 
institutions destined to become world-class 
universities. We have similarly selected eight. 


FRANCOIS HOLLANDE 


French universities have experienced huge 
changes under President Sarkozy. Would you 
continue or change his programme? 

The 2007 law on university autonomy must 
be profoundly reformed, but there will be no 
going back on the principle of autonomy. How- 
ever, when responsibilities are transferred from 
the state to the universities, there needs to bea 
comparable transfer of resources. 

I want the university-funding mechanisms 
to be clarified, and for these not to increase 
disparities, in particular by cutting back on 
teaching or the salaries of lecturers and full- 
time researchers. The reforms will be discussed 
within a framework law on higher education 
and research at the end of 2013, preceded this 
year by a national consultation. 


Many leading scientists complain that lab 
funding by the national research organizations 
is falling. How would you rectify this situation? 
This year and last, the overall funding lev- 
els for the national research organizations 
and the National Research Agency fell. This 
proves that the President’s words are just 
smoke and mirrors. I will rebalance core lab 
funding — the permanent support that lets 
teams think in the medium and long term — 


There is not that much disparity in opportu- 
nities among the universities. They have other 
opportunities beyond Idex. Ninety per cent of 
French universities have received some form 
of support from the Investments for the Future 
programme. If we want world-class research 
champions, we need to concentrate resources 
on those places where the research is best. 

I'ma big fan of cycling, and I love watching 
the Tour de France. We've never seen the pack 
accelerate because those at the rear go faster; 
the pack accelerates when the leaders acceler- 
ate. Similarly, I think the Investments for the 
Future programme, because it rewards the best, 
will lift the entire French system of research and 
higher education, not just the winners. 


What is the future for the large national research 
agencies, such as the CNRS and INSERM? 
France is a very special case. Just after the Second 
World War, we created agencies separate from 
the universities to do basic research. At the time, 
such a set-up was found only in Communist 
countries, in particular the USSR and China. 
Now, even these countries have abandoned this 
model. In the United States, the vast majority 
of research operators are universities. That is 
not yet the case in France, but it is our goal. The 
research organizations are expected to become 
more like real funding agencies, serving the uni- 
versities and research institutions. 


Many leading scientists complain that lab 
funding by the national research organizations 
is falling. How would you rectify this situation? 
In 2011, research spending as a percent- 
age of gross domestic product reached 2.3%, 
close to its historical high [of 2.36%] in the 
early 1990s. The budgets of the main public 
research agencies — 


“We need to the CNRS, INSERM, 
concentrate CEA, INRA — have 
resources on risen every year since 
those places 2007. The National 
where the Research Agency’s 
research is budget has stagnated, 
best.” but the agency is also 


responsible for allo- 
cating €22 billion for higher education and 
research under the Investments for the Future 
programme. The best French research teams 
have enjoyed an unprecedented increase in 
funding, but these funds have not necessarily 
passed through the main research agencies. 


Scientists say that you have increased 
bureaucracy in an already-complex research 
landscape. How would you streamline this? 

It is true that new structures have emerged. 
Many need to be consolidated, and others need 
to be simplified or reduced. The landscape of 
French research and higher education is being 
restructured. It is like a major construction site 


IN FOCUS | NEWS | 


where, as the project advances, the scaffolding 
is taken down, the tarpaulin is taken off and 
the new building appears. That is how it will 
be for the universities and research in France. 


Would you maintain nuclear power’s 75% 
share of French electricity generation? 

I want to intensify research to develop safer 
nuclear generators, but also boost research and 
development into alternative energy sources. 
I want to manage the inevitable transition 
towards alternative energies that will take place 
in France. In the meantime, it would be stupid 
and counterproductive to deprive France of its 
valuable nuclear-energy assets. 


Would you change France’s laws on human 
embryo and embryonic-stem-cell research? 
The issue was widely debated in parliament 
during the revision of the bioethics laws in 2011. 
The current situation, in which embryo research 
is prohibited but there are dispensations for 
work that could lead to major therapeutic pro- 
gress and where alternatives are not available, is 
strictly equivalent from a scientific point of view 
to what the Socialists propose. During the 2011 
debate on bioethics, the French overwhelmingly 
expressed their commitment to the sanctity of 
life from its beginning. The national represen- 
tation therefore ruled that the existing system 
should be maintained. 


with project funding. I will refocus the 
National Research Agency [the main source 
of project funding] on national priorities, 
emerging themes and interdisciplinary pro- 
jects. I will order a financial audit of the plans 
launched by the previous government. 

Other countries think French research 
is being inundated with money. But this 
new money doesn’t compensate for the cuts 
observed at the CNRS over the past two years, 
which has resulted in 25% less cash in the 
labs. There is a chasm between the €22 billion 
announced for Investments for the Future and 
the reality, which is that the funding is only the 
interest on that endowment distributed over 
ten years — for every €1 billion announced, 
the reality is just €34 million per year. 


Some scientists say that excellence initiatives 
are creating a two-tier system. How will you 
maintain research quality across France? 

We need to restore the true sense of the term 
‘excellence. The Investments for the Future 
programme has worsened disparities between 
universities. French research is not just Paris, 
Bordeaux, Toulouse, Aix, Marseilles and Stras- 
bourg. I will not reconsider decisions that have 
been made but, after the audit, I will demand 
that unallocated credits be used to correct 
geographical disparities, to ensure that these 
do not develop into scientific and university 


deserts. I will push to replace competition 
between universities with cooperation. 


Would you prioritize research spending? 

I will make higher education and research a pri- 
ority, because the major problem facing France 
is to prepare for the 


future. I will make “Thetrust 
youth the coreofmy between 
programme, giving researchers 
priority toeducation gqnd political 
from early childhood —Jegders has 
through to university. heen broken.” 


I will favour basic 

research, which has been sacrificed these past 
few years, paying particular attention to the life 
sciences, where funding is far below the inter- 
national average, and environmental sciences. 


What is the future for the large national research 
agencies, such as the CNRS and INSERM? 

We shouldn't break what works. France is lucky 
to have its large national research organizations 
alongside the universities. These organizations 
must recover their role of steering the national 
research effort, and of supporting research car- 
ried out in the universities. 


What will be your policies on nuclear and other 


energy technologies? 
I want France to accomplish an energy 
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transition with the share of nuclear energy 
being reduced from 75% to 50% of electric- 
ity production within 15 years. I will create 
major research programmes on ‘disruptive’ 
technologies such as electricity storage, the 
greening of various production sectors and 
energy efficiency. 


Would you change France’s laws on human 
embryo and embryonic-stem-cell research? 

I will authorize research on embryonic stem 
cells to clarify the situation of the 2011 law 
on bioethics, which maintained a ban on this 
research while allowing dispensations for par- 
ticular projects. It is important to put an end 
to this hypocrisy. 


How do you assess President Sarkozy’s impact 
on research and universities? 

The President has introduced multiple reforms 
without providing the necessary resources. In 
the past decade, we have fallen from 4th to 
15th place worldwide in terms of spending 
on research as a percentage of GDP — 2.26% 
compared with 2.82% in Germany and 2.9% 
in the United States. France is trailing, and the 
trust between researchers and political leaders 
has been broken. My first objective will be to 
restore that trust. m 
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Japan gambles on displays 


Consortium of three big brands gains government backing to spur mobile technologies. 


BY KATHERINE BOURZAC 


ith ever more beautiful and 
addictive smartphones and tablets 
captivating consumers, the market 


for mobile display screens is booming. Yet 
Japanese display makers Hitachi, Sony and 
Toshiba, once industry giants, have languished. 
Now the three companies have merged their 
mobile-display subsidiaries to form Japan Dis- 
play Incorporated, launched on 1 April. With 
¥150 billion (US$1.85 billion) in research and 
development funding over the next five years, 
the Tokyo-based endeavour aims to launch 
technologies that will help its members to 
recapture the display market. 

“We expect this company will be driven by 
research and development,” says Hiroyuki 
Oshima, Japan Display’s deputy chief technol- 
ogy officer. Many of the features that sell tablets 
and smartphones are based on display innova- 
tions, suchas the super-high-resolution screens 
of Apple’ iconic iPad and iPhone devices. Now 
the race is on to lengthen battery life, make 
gadgets thinner and lighter, and provide rich 
colours that can be seen from wide angles for 
watching videos with a friend. “The display tech- 
nologies that have the slickest stuff command 
the biggest market,” says 


a i head of the “Aggressive 
es e : oe chat at investment in 
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Japan Display’s bid couldmakea 


is mainly financed by 
a government-backed 
investment fund, the 
Innovation Network Corporation of Japan, 
which has contributed ¥200 billion and holds 
a 70% share in the company. A particular area 
of focus, Oshima says, will be organic light- 
emitting diodes (OLEDs), which the com- 
pany will begin producing by the end of next 
year. OLEDs, which use polymers or smaller 
carbon- and metal-based molecules, provide 
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A DISPLAY OF UNITY 


Toshiba, Sony and Hitachi hope to reverse a steady decline in their share of the mobile display-screen market 


by combining forces to form ‘Japan Display’, with a m 
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richer colour than conventional liquid-crystal 
displays (LCDs), and use less power. 

Stephen Forrest, vice-president of research at 
the University of Michigan in Ann Arbor, notes 
that OLEDs are a speciality of South Korea’s 
Samsung Mobile Display, the world leader with 
23.3% of the mobile-display market. Yet, he 
adds, there is still room for innovation. “Some 
things about OLED technology are not settled? 
he says. For example, researchers are still figur- 
ing out the most cost-effective design for the 
electronics that drives the displays. Low-cost 
‘amorphous-silicor’ transistors are adequate 
for conventional LCD screens, whose pixels 
change their light-transmitting properties 
depending on an applied voltage. By contrast, 
OLED pixels are driven by changes in current, 
and amorphous-silicon transistors cannot pro- 
vide enough. One solution is to subject amor- 
phous silicon to a laser treatment that upgrades 
it to better-quality polysilicon. But this is an 
expensive process. 

The leading alternative to polysilicon for 
OLEDs is a new class of transistor material, 
based on metal oxides. These materials 
promise good performance at lower manu- 
facturing costs than polysilicon. Before the 
merger, the three companies were all develop- 
ing metal oxides, and Oshima says that he is 


now trying to get the research teams to work 
together. However, Japan Display may have 
some catching up to do: on 13 April, Sharp, 
another Japanese maker, announced that it 
will become the first to bring metal oxides into 
production. 

In other areas, Oshima says, the research 
strengths of the three merged companies are 
more complementary. For example, Sony 
has developed a technology for embedding 
touch-screen sensors, typically a separate ele- 
ment sold by another company, into the display 
itself. Ifit can be made to work, this ‘embedded’ 
touch should make phones thinner and lighter, 
and reduce costs. 

Analysts at DisplaySearch, a market research 
company headquartered in Santa Clara, Cali- 
fornia, predict that Japan Display will become 
a big player, starting with a projected 13.8% 
market share in its first quarter (see ‘A display 
of unity’). Prior to their joint effort, the three 
companies didn’t invest enough resources in 
innovation in the mobile-display market, says 
Oshima. Now, Japan Display will put all its 
research efforts there. Colaneri thinks that the 
more focused approach could help the com- 
pany to grow further. “Aggressive investment 
in research and development could make a 
difference; he says. m 
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Exome sequencing could help to identify the causes of intellectual disability in children such as Siebe. 


MEDICAL GENETICS 


Gene hunt is on for 
mental disability 


Pioneering clinical genome -sequencing projects focus on 
patients with developmental delay. 


BY EWEN CALLAWAY 


edical geneticists are giving genome 
Messrs its first big test in the 

clinic by applying it to some of their 
most baffling cases. By the end of this year, 
hundreds of children with unexplained forms 
of intellectual disability and developmental 
delay will have had their genomes decoded as 
part of the first large-scale, national clinical 
sequencing projects. 

These programmes, which were discussed 
last month at a rare-diseases conference hosted 
by the Wellcome Trust Sanger Institute near 
Cambridge, UK, aim to provide a genetic 
diagnosis that could end years of uncertainty 
about a child’s disability. In the longer term, 
they could provide crucial data that will under- 
pin efforts to develop therapies. The projects 
are also highlighting the logistical and ethical 
challenges of bringing genome sequencing to 
the consulting room. “The overarching theme 
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is that genome-based diagnosis is now hitting 
mainstream medicine,” says Han Brunner, a 
medical geneticist at the Radboud University 
Nijmegen Medical Centre in the Netherlands, 
who leads one of the projects. 

About 2% of children experience some form 
of intellectual disability. Many have disorders 
such as Down's syndrome and fragile X syn- 
drome, which are linked to known genetic 
abnormalities and so are easily diagnosed. 
Others have experienced environmental risk 
factors, such as fetal alcohol exposure, that rule 
out a simple genetic explanation. However, a 
large proportion of intellectual disability cases 
are thought to be the work of single, as-yet- 
unidentified mutations. 

Scientists estimate that about 1,000 genes 
are involved in the function of the healthy 
brain. “There are so many genes that can go 
wrong and give you intellectual disability,’ says 
André Reis, a medical geneticist at Erlangen 
University Hospital in Germany. Reis’s group, 
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the German Mental Retardation Network, has 
already sequenced the exomes — the 1-2% 
of the genome that contains instructions for 
building proteins — of about 50 patients with 
severe intellectual disability. 

Joining the hunt is a UK-based programme 
called Deciphering Developmental Disorders, 
which expects to sequence 1,000 exomes by the 
year’s end, with an ultimate goal of diagnos- 
ing up to 12,000 British children with devel- 
opmental delay. A Canadian project called 
FORGE (finding of rare disease genes) aims 
to sequence children and families with 200 dif- 
ferent disorders this year. And in the United 
States, the National Human Genome Research 
Institute in Bethesda, Maryland, recently 
funded three Mendelian Disorders Sequencing 
Centers that will apply genome sequencing to 
diagnosing thousands of patients with a wider 
range of rare diseases, including intellectual 
disability and developmental delay. 


FIRST GLANCE 

Early results are coming in from Brunner’s 
team, which has already sequenced about 
100 exomes of children with intellectual dis- 
ability. By comparing the children’s exomes 
with those of the parents, the researchers have 
identified new mutations — potential causes of 
the disorder — in as many as 40% of the cases. 
The other programmes are having similar suc- 
cess at making possible genetic diagnoses. 

In most cases, identifying mutations will 
not point to medical treatments, let alone 
cures. But scientists say the importance of a 
diagnosis should not be discounted. “Parents 
have been struggling with the delay of their 
children for years. They have gone from one 
doctor to the next, had all kinds of tests done 
on their children looking for an explanation,’ 
Reis says. Knowing that the mutation causing a 
child's intellectual disability is new rather than 
inherited can also reassure parents that other 
children they conceive are unlikely to have the 
same disease. 

Treatments could eventually follow. 
The projects are guiding research in mice, 
zebrafish and fruitflies, with the goal of 
unpicking the mechanisms of mental disor- 
ders. But it will undoubtedly be a long time 
before any potential therapies are tried in 
humans: an early-stage clinical trial of a drug 
to treat fragile X syndrome, for example, was 
published last year (S. Jacquemont et al. Sci. 
Transl. Med. 3, 64ra1; 2011), some two dec- 
ades after the gene underlying the condition, 
FMRI, was identified. 

The work is also throwing up a fresh chal- 
lenge: how can scientists be sure that a specific 
mutation is the cause of a particular form of 

mental disability? “It’s 


> NATURE.COM not clear what is the 
Read more about threshold of evidence 
‘genomes on at which you can say 
prescription’ at: this is the causal vari- 
go.nature.com/yv7rls ant in this patient, says 


H. BRUNNER 


Daniel MacArthur, a geneticist at Massachu- 
setts General Hospital in Boston. Ina recent 
Science paper, his team estimated that the aver- 
age healthy genome contains about 100 gene- 
disabling mutations. Such ‘background noise’ 
could lead scientists astray in their hunt for 
causal mutations. 

Brunner says that about half of the muta- 
tions his team has identified have previously 
been seen in other patients with similar forms 
of intellectual disability, offering enough assur- 
ance to make a diagnosis. Circumstantial evi- 
dence, such as indications that the mutation 
disrupts a gene expressed in the brains of 
animals, ties the other half of the mutations 
to intellectual disability. But making a solid 
case often requires identifying second, third 
and fourth patients with similar mutations and 
symptoms. 

Scientists are already forging these con- 
nections on an informal basis. At the Sanger 
Institute meeting, several groups reported 
mutations in a gene called ARID 1B in patients 
with intellectual disability. James Lupski, a 
medical geneticist at Baylor College of Medi- 
cine in Houston, Texas, says that when his 
team identifies a potentially disease-causing 
mutation in a patient genome, he e-mails other 
scientists to see whether they have found simi- 
lar mutations. 

But researchers agree that they need a more 


formalized way to make these connections. To 
that end, the US National Center for Biotech- 
nology Information in Bethesda is developing 
a database, ClinVar, to integrate clinical and 
genetic data; others, such as DECIPHER, run 
by the Sanger Institute, handle genetic data 
such as chromosome rearrangements that can 
disrupt genes. 

The first clinical sequencing projects are also 
grappling with what they should or shouldn't 
tell patients. “We don’t want people coming 
into our clinic for intellectual disability and 
coming out with a 


“Genome- cancer gene; this is not 
based a! what they came for,” 
diagnosis is says Reis. 

now hitting Brunner’s team 
mainstream once had to face just 


that situation. The 
researchers identi- 
fied a mutation in a gene in one patient that 
could increase the risk of colon cancer as an 
adult. The project's ethical review board had 
determined that if families wanted to know 
of mutations potentially underlying a child’s 
intellectual disability, they must also be willing 
to receive such incidental findings, and so the 
child’s parents were told. But clinical sequenc- 
ing projects vary in their approach to inciden- 
tal results. For the time being, Deciphering 
Developmental Disorders will not inform 


medicine.” 
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families about such findings. For FORGE 
Canada, the policy varies from province to 
province. 

A working group convened by the American 
College of Medical Genetics and Genomics 
in Bethesda recently suggested drawing 
up a list of gene mutations that ought to be 
routinely reported back to patients. The list 
would include mutations strongly linked to 
conditions for which a medical intervention 
is available. 

“This is a fast-changing ethical environ- 
ment,’ says Matt Hurles, a geneticist at the 
Sanger Institute and one of the leaders of Deci- 
phering Developmental Disorders. His team 
is conducting a web-based survey to gauge the 
attitudes of parents, physicians and the general 
public towards disclosing incidental genomic 
findings. Lupski admits, “We're learning as we 
go along. People don’t want to hear that, but 
that’s the truth of the matter.” 

Scientists and clinicians hope that the les- 
sons learned in these initial large-scale clinical 
sequencing projects will inform genomic med- 
icine as it reaches more patients and moves to 
other specialities, such as neurology and car- 
diology, and even to routine health care. “Ifin 
five years time this project hasnt catalysed the 
adoption of genomic technologies which have 
been shown to be useful, in some degree we 
will have failed, says Hurles. m 
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Neuroscience shows 
that the adolescent 
brain is still developing. 
The question is whether 
that should influence the 
sentencing of juveniles. 


ILLUSTRATION BY ANDY POTTS 


BY LIZZIE BUCHEN 
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T eenagers can do terrible things. In 

1999, Kuntrell Jackson, then 14, was 
walking with his cousin and a friend 
in Blytheville, Arkansas, when they decided to 
rob a local video store. On the way there, his 
friend, Derrick Shields, revealed that he was 
carrying a sawn-off shotgun in his coat sleeve. 
During the robbery, Shields shot a shop worker 
in the face, killing her. 

Four years later, 14-year-old Evan Miller and 
an older friend were getting drunk and stoned 
with a middle-aged neighbour in a trailer park 
in Moulton, Alabama. A fight broke out, and 
Miller and the friend beat the neighbour with 
a baseball bat. Then they set fire to his home 
and ran, leaving him to die. 

Both Miller and Jackson were found guilty of 
homicide and sentenced to life without parole, 
meaning that both will spend the rest of their 
lives in prison. They are not alone. The United 
States currently has more than 2,500 indi- 
viduals serving such sentences for crimes they 
committed as juveniles — that is, before their 
eighteenth birthdays. It is the only country that 
officially punishes juveniles in this way. Both 
Miller and Jackson appealed, arguing that their 
immaturity at the time of the crime rendered 
them less culpable for their actions than adults, 
and that they deserved a less severe punish- 
ment. The Supreme Court heard arguments 
in Miller v. Alabama and Jackson v. Hobbs in 
March, and is expected to deliver its ruling by 
this summer. 

The cases are notable not only because they 
could abolish life-without-parole sentences 
for juveniles, but also because neuroscience 
research may play a part in the decision. Sev- 
eral organizations submitted briefs to the court 
detailing the growing body of research show- 
ing that the brain’s development continues into 
at least the early twenties. This, they argue, may 
explain why young people are more impulsive 
than adults, more readily swayed by their peers 
and less likely to consider the consequences of 
their actions. 


UNDER THE INFLUENCE 

Advocates for juveniles have been embracing 
this work as part of a long-term strategy to 
ensure that young criminals are given less pun- 
ishment than adults and more opportunities 
for rehabilitation. And many neuroscientists 
studying the adolescent brain are gratified 
that their work is contributing to these efforts. 
“Tt’s so satisfying to think that maybe in some 
minuscule way my work was relevant to soci- 
ety,” says Bea Luna, who studies adolescent 
brain development at the University of Pitts- 
burgh in Pennsylvania. 

But the brain research may not have as great 
an influence in court as some scientists and 
advocates like to think. Some say that the neu- 
roscience offers no fresh insight into adolescent 
behaviour, and may serve merely as a rhetorical 
flourish in judges’ opinions or as a tool that law- 
yers and advocates exploit to make their case. 


“The neuroscience is being used for an advo- 
cacy position,” says Emily Murphy of Stanford 
University in California, who was a fellow with 
the MacArthur Foundation’s Law and Neuro- 
science Project. “That's all it’s always been, ina 
legal context.” Murphy and others worry that 
the neuroscience currently being used in court 
may be abused, and might overshadow other 
research that could make a deeper impact on 
juvenile crime and punishment. 

Historically, courts in the United States have 
treated juveniles with leniency. But in the late 
1980s and early 1990s, a rise in violent youth 
crime, including several high-profile school 
shootings, provoked a backlash of tough 


A TEENAGER IS LIKE 
ACAR WITHA GREAT 
ACCELERATOR BUT 
TERRIBLE BRAKES. 


justice. Across the country, young offenders 
were increasingly transferred from the juvenile 
courts, which emphasize rehabilitation, to adult 
criminal courts, which focus on retribution and 
punishment, including the death penalty. 
Advocates for juveniles have fought to 
reverse this trend, and have done so, in part, by 
pointing at major changes now known to take 
place in the adolescent brain. During matura- 
tion, the brain undergoes large-scale struc- 
tural changes. Fatty tissue called myelin wraps 
around neurons, speeding up signal trans- 
mission in brain cells, particularly between 
brain regions. At the same time, superfluous 
connections are pruned away in a process that 
may decrease noise in the system and allow 
remaining neurons to function more efficiently. 
These processes were once assumed to be 
completed during childhood, but techniques 
developed over the past two decades have shat- 
tered that idea. Structural magnetic resonance 
imaging (MRI) shows that pruning goes on 
into early adulthood. Diffusion tension imag- 
ing, another form of MRI, shows much the 
same for myelination. And functional MRI 
(fMRI), which reveals activity in the brain in 
near real time, has shown how this ongoing 
development affects activity levels in different 
parts of the brain. Some of the most marked 
changes during these later developmental 
periods occur in brain systems associated 
with impulse control’”, resisting immediate 
rewards’ and emotional processing*” — all 
behaviours relevant to criminal activity. 
Systems involved in processing reward 
also seem to mature more quickly than those 
involved in decision-making and impulse 
control. One fMRI study® showed that when 
adolescents were presented with rewards for 
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performing a simple task, the nucleus accum- 
bens — part of the brain's reward system — lit 
up in a pattern similar to that seen in adults in 
the study, but much more strongly. Meanwhile, 
the sparse activity seen in the prefrontal cor- 
tex — a region thought to be involved in deci- 
sion-making and impulse control — looked 
more akin to that of children. The authors of 
the study say that this may help to explain the 
heightened risk-taking behaviour common 
in adolescents. A teenager, as advocates and 
some scientists like to say, is like a car witha 
great accelerator but terrible brakes. As Larry 
Steinberg, a psychologist who studies teen 
behaviour at Temple University in Philadel- 
phia, Pennsylvania, has said, “With powerful 
impulses under poor control, the likely result 
isa crash” 


CRIME AND PUNISHMENT 

The Supreme Court got its first taste of this 
research in 2005, when it considered Roper v. 
Simmons. In 1994, Christopher Simmons was 
sentenced to death in Missouri after he and an 
accomplice wrapped a woman's head in duct 
tape, bound her limbs with electrical wire, and 
threw her off a railway bridge. He was 17 at the 
time of the crime. When the case reached the 
Supreme Court, the American Medical Asso- 
ciation (AMA) and American Psychological 
Association (APA) filed briefs in Simmons’s 
support explaining the current state of research 
on the immaturity of adolescent brains. 

The court found it unconstitutional to 
impose the death penalty for crimes com- 
mitted by juveniles; Simmons is now serving 
life without parole. Although the neurosci- 
ence evidence was never directly cited, Jus- 
tice Anthony Kennedy wrote in the majority 
opinion that “as any parent knows and as the 
scientific and sociological studies ... tend to 
confirm, ‘[a] lack of maturity and an under- 
developed sense of responsibility are found in 
youth more often than in adults”. 

Many scientists and advocates for juveniles 
considered the result a triumph for neurosci- 
ence in the court, likening it to Brown v. Board 
of Education of Topeka, the landmark 1954 
ruling that ended racial segregation in public 
schools. Brown v. Board was believed to be 
heavily influenced by a sociological study of 
children judging dolls of different colours. This 
purportedly showed that segregation had a 
negative impact on black students’ self-esteem. 
Just as that case is thought to mark the mod- 
ern era of the court’s use of scientific research, 
Roper v. Simmons was believed to herald the 
era of neuroscience in the court. 

Emboldened by the Roper decision, advo- 
cates and attorneys have increasingly called 
on neuroscience research when arguing that 
juveniles should be spared the harshest pun- 
ishments. In 2010, the court ruled in Graham v. 
Florida that sentencing a juvenile to life with- 
out parole for a crime other than homicide 
was “cruel and unusual”. The neuroscience 
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research, emphasized in briefs from the AMA, 
the APA and Graham's lawyers, seemed to 
strike even more of a chord with the justices, 
who wrote in the majority opinion that “devel- 
opments in psychology and brain science 
continue to show fundamental differences 
between juvenile and adult minds”. 


BRAIN OVERCLAIM 

But some researchers are uncomfortable 
with the way this research is being used in 
the criminal justice system. There is certainly 
a correlation between brain development 
and behavioural maturity, but saying that 
observed differences in the adolescent brain 
cause certain adolescent behaviours is a symp- 
tom of what lawyer and psychologist Stephen 
J. Morse at the University of Pennsylvania in 
Philadelphia calls “brain overclaim syndrome’. 
Neuroscientists, says Morse, are “always using 
language that suggests causation, when they 
don’t know causation’. 

Moreover, the behaviours studied in the lab 
bear little resemblance to criminal behaviour. 
Scientists cannot perform brain imaging on 
someone in the process of committing a crime. 
Instead, they study volunteers who are chal- 
lenged to resist impulses or follow rules. In 
some experiments looking to assess impulse 
control, for example, volunteers are told not to 
look at a flashing light. Not looking at the light, 
says Luna, who has used the task for more than 
ten years, “is extremely difficult to do. We 
know exactly what brain circuitry gets engaged 
to stop that reflexive response. Is it the same 
as murdering someone? No.” But Luna argues 
that such experiments are relevant. Although 
adolescents can stop themselves from looking 
at the light, doing so is associated with much 
more activation of their prefrontal cortex than 
is seen in adults, suggesting that it is more dif- 
ficult for them to control their impulses’. 

But that wasnt really in question. Decades’ 
worth of behavioural research had already 
established that teenagers take greater risks, 
are more driven by emotions and peer influ- 
ence, and generally behave less responsibly 
than adults. Still, advocates say, neuroscience 
bolsters the arguments. “When you come up 
with a biological mechanism, it gives it a lot 
more credibility,’ says Luna. To many audi- 
ences, neuroscience evidence tends to be 
more convincing than behavioural science, 
particularly when aided by pictures of brain 
areas lighting up. 

“Lawyers and judges have grown up thinking 
that social science is soft,’ says constitutional 
law scholar David Faigman at the University 
of California Hastings College of the Law in 
San Francisco. “Neuroscience gives the courts 
a hook.” But, Morse 
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only relevant to the law when it connects with 
behaviour. “If the psychology is soft, the neu- 
roscience is soft;’ Morse says. 

Hard or soft, the data are open to interpreta- 
tion. And scientists have little control over when 
or how their research is used as a persuasive 
tool. Steinberg’s research was cited in the court's 
opinion in Roper v. Simmons and in briefs in 
support of juveniles for Graham v. Florida and 
the two current cases — but it has also been 
used by advocates seeking to restrict the rights 


JUDGES MAY 
USE SCIENCE 
FOR RHETORICAL 
LEVERAGE, BUT 
SCIENCE RARELY, 
IF EVER, DRIVES A 
COURT DECISION. 


of adolescents, in particular the right to have an 
abortion without parental permission. 
Ultimately, the neuroscience data can only 
take legal arguments so far. Juveniles may be 
less responsible than adults, but they may still 
be responsible enough to deserve the same pun- 
ishment for similar crimes, says Morse. “That's 
not a scientific question. It’s a moral question, 
and, ultimately, a legal question.” Because of 
this incompatibility, Faigman says that science 
rarely, if ever, drives a court decision. Judges 
may use science for rhetorical leverage when 
writing the opinion, but “it’s not always clear 
whether the research made an impact”. 


HOT HEADS OR COLD BLOODED 

Take Roper v. Simmons. Briefs to the court 
supporting Simmons presented scientific evi- 
dence that the immature brains of adolescents 
like him render them more likely to act in the 
heat of the moment, giving little thought to the 
future. But Simmons didn't fit that descrip- 
tion: according to Supreme Court docu- 
ments, he told his friends that he “wanted to 
murder someone” days before the murder, “by 
breaking and entering, tying up a victim, and 
throwing the victim off a bridge”. 

Although the court alluded to scientific 
research in its opinion, that research may not 
have influenced the decision. “The justices 
may have [already] come to the conclusion that 
the death penalty was fundamentally flawed, at 
least in its application to juveniles,” says Faig- 
man. “They needed to support that with good 
reasons.” 

If science’s part in Roper v. Simmons was 
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merely as a makeweight, then comparing 
it to Brown v. Board might not be far off the 
mark. In that case, Faigman says, the court had 
probably already decided to end segregation in 
public schools on moral and social grounds. 
Although “the perception was that science lay 
at the core of the decision’, says Faigman, the 
doll study had been “severely criticized” before 
the ruling. It was only mentioned in a footnote. 

When the court decides on Miller v. Ala- 
bama and Jackson v. Hobbs, advocates for 
juveniles hope to see life without parole taken 
off the table for all juveniles. But the court may 
draw the line at 14, the age of the petitioners. 
Whatever line the justices do draw, if they draw 
one at all, it will probably be based on legal and 
moral ideas rather than scientific evidence. 

So does neuroscience deserve a place at 
the table? Perhaps, says Terry Maroney of 
Vanderbilt University Law School in Nashville, 
Tennessee, but it shouldn't outshine other evi- 
dence. Solid research by social scientists shows, 
for example, the high potential of teen crimi- 
nals to reform, the inability of life sentences 
to deter juvenile crime, and racial inequalities 
in life-without-parole sentencing*. Focusing 
on biology draws attention away from socio- 
economic, educational and cultural drivers of 
criminal behaviour. After all, most teenagers 
do not commit murder, despite their fledgling 
frontal cortices. 

Evan Miller grew up with physical and 
emotional abuse so severe that he attempted 
suicide several times, starting at the age of 
five. He was using drugs and alcohol regularly 
before he was a teenager; the man he killed 
was his mother’s drug dealer. Kuntrell Jackson 
was raised in the lap of violence. Bryan Ste- 
venson, his attorney, told the Supreme Court 
justices that “his grandmother shot his uncle. 
His mother shot a neighbour. His brother shot 
someone. They were all put to jail” 

Neuroscience supports what centuries of 
casual observation have strongly suggested: 
teenagers tend to be more impulsive and irre- 
sponsible than adults. That probably can’t be 
changed, but the environments that they grow 
up in can be addressed with social policy. 
“That’s harder,’ says Maroney. “It requires 
more work and sacrifice. It’s easier to look and 
say, ‘Ooh, look what's going on inside their 
heads! It’s all their problem” m 


Lizzie Buchen is a freelance reporter in San 
Francisco and volunteers with prisoners at San 
Quentin State Prison. 
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Two scientists work on the near detector of the MINOS neutrino experiment at Fermilab in Illinois. 
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tiques,” says Phil Adamson, as he describes his recent 

US$250,000 buying spree. Adamson, a physicist at the 
Fermi National Accelerator Laboratory (Fermilab) in Bata- 
via, Illinois, leads a team that has spent the past few months 
acquiring and installing three ultra-high-precision atomic 
clocks; six Global Positioning System receivers; more than a 
kilometre of optic fibre; two auxiliary detectors and at least one 
pair of timing-interval counters (“kind of fancy stopclocks’, 
he says) — all to time subatomic neutrinos with nanosecond 
precision as they pass through the detectors of Fermilab’s Main 
Injector Neutrino Oscillation Search (MINOS). 

The researchers are trying to answer one simple question: 
do neutrinos travel at or below the speed of light, as required 
by the theory of relativity, one of the most fundamental 
tenets of modern physics? Or do they travel just a tiny fraction 
faster, as suggested with enormous fanfare last September by 
an experiment in Italy? 

To outsiders, this debate has already been settled. Research- 
ers on the Italian experiment, based at Gran Sasso National 
Laboratory near LAquila, announced in March that they had 
found the error in their measurement — and two of the team’s 
leaders resigned (see go.nature.com/xjzhqa). At the same time, 
physicists working at a different detector at Gran Sasso have 
published measurements” showing that neutrinos do indeed 
obey the light-speed limit. 

Yet neutrino speed is still a prime focus for the MINOS 
physicists, who are carrying out their own high-precision 
measurement — not least because they did not do this when 
they saw hints of a faster-than-light neutrino in their own data 
some years ago. “This is so much on our front burner,’ says 
Robert Plunkett at Fermilab, a spokesman for MINOS. Around 
30 members of the 150-strong collaboration are now working 
on the search. 


“) t’s like Christmas shopping at the specialist bou- 
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Precision is what the MINOS team is all about. In 2008, Fer- 
milab’s Tevatron was supplanted as the world’s highest-energy 
particle accelerator by the Large Hadron Collider (LHC) at 
CERN, Europe's particle-physics laboratory outside Geneva, 
Switzerland. Since then, US particle physicists have moved 
from studying collisions at the highest energies to working 
with beams at the highest intensities, adapting the country’s 
existing accelerator facilities to measure the rates of extremely 
rare interactions. The hope is that, even at lower energies, 
forcing theory and experiment into close comparison can turn 
up anomalies that point to new physics. 

Neutrino physics is the centrepiece of that programme. The 
particles respond only to the aptly named weak force, and 
mostly stream through solid matter as if there were no bar- 
rier. But this aloofness also makes neutrinos potentially a very 
clean probe of exotic forces: when a neutrino does hit another 
particle, physicists don't have to disentangle the effects of the 
much larger strong and electromagnetic forces. Experiments 
using accelerators, in which physicists can control the energy 
and direction of the neutrino beam, take maximum advantage 
of that fact. And MINOS is the most sophisticated experiment 
of this kind in the United States. 

True, it takes a special kind of researcher to pursue such 
work — pushing measurements towards the last possible deci- 
mal point, over decades if need be. But most of the MINOS 
physicists share the attitude voiced by team member Nathaniel 
Tagg, a physicist at Otterbein College in Westerville, Ohio. “I'm 
one for long shots.” 


COWBOY PHYSICS 

Neutrinos’ central role in US physics research is a case of 
historical turnabout, says Tagg. For years the field was con- 
sidered ‘cowboy physics’ — a fringe area best left to diehard 
experimentalists who were willing to drag their detectors down 
remote mineshafts and into underground bunkers, where the 
devices could escape the confounding effects of atmospheric 
cosmic rays. 

The archetypal cowboy was Raymond Davis, a radiochemist 
at Brookhaven National Laboratory in New York, who wanted to 
make the first direct observations of nuclear fusion reactions in 
the core of the Sun. He began in 1967 by setting up a 380-cubic- 
metre tank of perchloroethylene, a dry-cleaning fluid, nearly 
1,500 metres down in the Homestake gold mine in Lead, South 
Dakota. Davis’s idea was that neutrinos created in the Sun’s 
nuclear reactions would very occasionally strike a chlorine atom 
in the tank and turn it intoa radioactive isotope of argon, which 
he could extract and detect by chemical means. But only about 
one-third of the expected number of neutrinos seemed to be 
showing up. Either the theorists were wrong about the rate of 
fusion reactions in the core of the Sun — which didn't seem 
likely, given the success of their calculations in other areas of 
astrophysics — or two-thirds of the neutrinos were getting lost. 

It took decades for physicists to reach a consensus that this 
‘solar neutrino problem was real, not some obscure experi- 
mental error, and only in the 1990s did they converge on a 
probable solution. This started from the hypothesis that neu- 
trinos come in three types, or ‘flavours, each of which is the 
electrically neutral partner of a negatively charged particle 
with mass — an electron, a muon or a tau. The Sun’s fusion 
reactions produce only electron neutrinos. But as soon as they 
are produced, according to the theory, 
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just one-third will be electron neutrinos, the only kind that 
Davis's tank could detect. 

This oscillation theory elegantly accounted for the missing 
solar neutrinos, but posed a new conundrum. The standard 
model of particle physics held that all three flavours of neutrino 
have a mass of exactly zero, like that of the photon. But oscil- 
lation was possible only if neutrinos have at least a very small 
mass — which meant that the particles were hinting at some 
kind of physics beyond the standard model. 

This prospect quickly took neutrinos out of the cowboy- 
physics category (Davis himself would share the Nobel Prize in 
Physics in 2002). MINOS was one result (see ‘Particle switch). 
Planning began in the 1990s as physicists looked for a way 
to verify the oscillation model. In particular, they wanted to 
create an artificial neutrino beam that would remove one of 
the biggest uncertainties in solar neutrino experiments, which 
was the number of neutrinos arriving at the detector. The 
plan called for Fermilab’s accelerator complex to fire protons 
at a graphite target, producing a spray of short-lived charged 
particles that would decay into muon neutrinos. The result- 
ing neutrino beam would pass through a 1,000-tonne, train- 
carriage-sized ‘near’ detector in the Fermilab grounds, where 
enough neutrinos would be captured to gauge the total number 
in the beam. The remaining particles would continue under- 
ground on a 735-kilometre, straight-line path to the Soudan 
Mine in northern Minnesota, where a five-times-larger ‘far’ 
detector would measure how many muon neutrinos arrived. 
Ifsome had oscillated into other forms, the discrepancy would 
be obvious — and neutrino oscillation would be verified. 


COLLISION COURSE 

In 1998, however, while MINOS was still on the drawing 
board, the US project was beaten to this goal by the Super- 
Kamiokande experiment near Hida in Japan. Through a 
clever experimental design, the Japanese physicists had been 
able to verify the existence of oscillation using neutrinos 
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generated by cosmic rays striking the atmosphere’. 

That disappointment took a toll, admits Adamson, one of 
many particle physicists drawn to the field by a desire to sort 
out the oscillation phenomenon. But it also proved an oppor- 
tunity. Rather than being the ones to discover new physics, the 
MINOS team decided that they would be the ones to carefully 
characterize the phenomena that the Super-Kamiokande had 
found. “By the time we started to operate [in 2005] we were 
trying to make precision measurements,’ says Adamson. 

One obvious question was why neutrinos have mass at all. 
The standard model can explain the mass of charged particles, 
such as the electron, as a subtle interaction with the hypotheti- 
cal Higgs boson. But that mechanism was not supposed to 
affect neutrinos. The three neutrino varieties have masses so 
tiny, less than one-millionth that of the electron, that some 
kind of exotic mass-generating mechanism may be at work, 
and high-accuracy measurements of the oscillation phenom- 
enon could shed light on what that is. 

Such questions soon began to move the long-term plan- 
ning for accelerator neutrino facilities to the forefront of US 
high-energy physics. In 2006, the Particle Physics Project Pri- 
oritization Panel of the Department of Energy (DOE) laid out 
a roadmap for the field that included continued support for 
MINOS and endorsement of a new experiment known as the 
NuMI Off-Axis Neutrino Appearance (NOVA). (NuMI, which 
stands for ‘neutrinos at the main injector’ is the name of the 
neutrino beam serving the experiment.) 


ACCELERATED DEVELOPMENT 

The plans for NOVA called for a boost in the energy of 
Fermilab’s neutrino beam and a new detector farther north 
in Minnesota. One of its key goals would be to measure how 
oscillation occurs among neutrinos’ three antimatter coun- 
terparts, the antineutrinos, and find out whether the process 
obeys ‘charge-parity’ symmetry. This symmetry, which basi- 
cally means that interactions should remain unchanged if 
particles and antiparticles swap places and everything is 
viewed in a mirror, is known to be violated in only a few, very 
rare reactions. Nevertheless, charge-parity violation is thought 
to be the ultimate explanation for the emergence of much more 
matter than antimatter from the early Universe, and why stars, 
planets and living things can exist today. If the symmetry can 
be violated for neutrinos and antineutrinos, then these ghostly 
particles could provide unique insights into the processes that 
have made the Universe the way it is. 

At MINOS, meanwhile, the team was busy boosting the 
intensity of Fermilab’s neutrino beam, setting a world record 
for precisely determining the difference between the masses of 
the three types of neutrino’, and measuring a variety of param- 
eters crucial for the design of NOVA. 

Then, in June 2010, scientists at MINOS reported early 
signs of a discrepancy in the rates at which neutrinos and anti- 
neutrinos oscillated*. This particular discrepancy would have 
violated another fundamental symmetry of quantum field 
theory known as CPT, for charge, parity and time. In its way, 
this would have been just as astonishing as faster-than-light 
neutrinos. MINOS scientists began to hope for a paradigm- 
shattering discovery. But these hopes were dashed when further 
data, reported in February this year’, suggested that the result 
was a statistical fluctuation. “It was disappointing,’ says Justin 
Evans, a physicist at University College London who is a mem- 
ber of the MINOS team. “We made the world’s most precise 
measurement of antineutrinos’ parameters — but everyone 
wants to be the group that discovers something new.” 

The faster-than-light neutrino announcement last Sep- 
tember from the Oscillation Project with Emulsion-tracking 
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Apparatus (OPERA) experiment in Gran Sasso shook the 
MINOS team out of its data-collecting routine. The Italian lab 
reported that neutrinos seemed to be making the 730-kilo- 
metre trip from CERN to Gran Sasso some 60 nanoseconds 
faster than a light beam would. The announcement galled 
some MINOS collaborators, who back in 2001 had proposed 
an ultraprecise measurement of neutrino speed only to have 
the idea nixed by the DOE. The team decided not to resurrect 
the idea in 2007, when its low-precision measurements hinted 
that neutrinos might be travelling faster than light’. It didn’t 
seem worth fighting that battle again to follow up a result with 
minimal statistical significance. 

So, stung by the hullabaloo from Italy — and aware that 
MINOS, given the similarity of its set-up to OPERA, was 
uniquely placed to provide the all-important independent 
replication of the remarkable finding — the MINOS team was 
determined to get the measurement right. 


“IT'S A PLEASURE AND PRIVILEGE TO 


BE IN A POSITION TO SETTLE THIS.” 


But MINOS was scheduled for a temporary shutdown in 
March 2012, during which the energy of its neutrino beam 
would be boosted to serve NOvA. MINOS would also receive 
some upgrades so that it could look for exotic phenomena 
such as sterile neutrinos — hypothetical particles that would 
not participate in any interaction governed by the standard 
model — and neutrino oscillation into extra dimensions. Yet 
everyone understood how momentous it would be if neutrinos 
really did violate Einstein's speed limit, says Adamson. Once 
the team realized that it had the people and the set-up to check 
the result in a reasonable time, he says, it felt it had a duty to 
weigh in. 

Fermilab approved a two-month delay in the neutrino beam 
shutdown, and the MINOS team resurrected and improved its 
decade-old proposal to do the measurement at high precision. 
By the time of the shutdown, now scheduled for 1 May, the 
experiment should have yielded data sufficient to pinpoint the 
time-of-flight measurement to within 11 nanoseconds, similar 
to OPERAS uncertainty. And by sometime in 2013, after the 
neutrino beam is upgraded to higher energy, the increase in 
event rate should allow MINOS to reach an error of between 
2. and 7 nanoseconds. 

Physicists at MINOS are determined to make the measure- 
ments, even if their chance ofa faster-than-light finding is now 
exceedingly remote given OPERAS admitted error. “There's a 
chance it will turn out to be really interesting,” says Tagg, But 
if, as looks likely, the measurement simply confirms some- 
thing that everybody already knows, MINOS will at least have 
produced a high-precision measurement of a fundamental 
parameter, something that physicists on the experiment insist 
is a noble contribution, even ifit’s not a discovery. “It’s a pleasure 
and privilege to be ina position to settle this,” says Plunkett. m 


Eugenie Samuel Reich is a contributing correspondent for 
Nature. 
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This eruption of plasma from the Sun in June 2011, captured by instruments on the STEREO spacecraft, didn’t cause a space weather storm on Earth. Others will. 


Prepare for the coming 
Space weather storm 


We need to improve estimates of geomagnetic storm size, says Mike Hapgood, 
so we can be ready for huge disruptions to electrical systems. 


n 13-14 March 1989, Earth 
() experienced its largest geomag- 

netic storm in decades: a barrage of 
hot, ionized particles streamed towards the 
planet, wreaking havoc on its magnetic field 
and electrical systems. The storm caused a 
power blackout in Quebec, Canada, leav- 
ing 5 million people without electricity 
for 9 hours in cold weather and causing 
Can$2 billion (US$2 billion) in damages 
and losses to business. It also permanently 
wiped out a US$12-million transformer in 
the United States, and caused two large UK 
transformers to be sent back to the factory 
for repairs. Space agencies temporarily lost 
track of some 1,600 spacecraft. 

We should be prepared for much worse. 
Even bigger storms have been recorded in 
the past, although society at the time was 
less reliant on electrical systems and thus less 
vulnerable. In September 1859, a huge geo- 
magnetic event disrupted the then-emerging 
technology of the electric telegraph, shocking 


operators and sparking fires in telegraph 
offices. A similar event in May 1921 burnt 
down a telephone exchange at Karlstad in 
Sweden. A storm on these scales today would 
prove disastrous to our highly electronic soci- 
ety. A study by the UK National Grid suggests 
that a repeat of the 1859 event would leave 
some regions without power for several 
months'. In the United States, some assess- 
ments predict wide-scale disruption with rip- 
ple effects lasting for years and an economic 
impact of several trillion dollars’. 

The source of all this damage is coronal 
mass ejections (CMEs) — huge eruptions of 
magnetically charged plasma that occur dur- 
ing magnetic storms in the Sun's atmosphere 
and that increase the stream of particles in 
the solar wind by many hundreds of times. 
When CMEs reach Earth, they can inject 
large amounts of energy into the planet’s 
magnetic field, causing field variations and 
driving extra current into electricity grids. 
The storms can heat and expand the upper 
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atmosphere, nudging the orbits of satellites 
and space debris, and can charge spacecraft 
components to dangerous voltages. 

There is some capability for forecasting 
space weather in the short term. The US 
Space Weather Prediction Center in Boul- 
der, Colorado, can provide warning for 
strong geomagnetic storms 10-60 minutes 
in advance with better than 50% reliability, 
using observations of approaching CMEs 
by NASAss Advanced Composition Explorer 
(ACE) spacecraft. That provides a small win- 
dow in which to apply countermeasures to 
protect large systems such as the power grid. 
The newer NASA Solar Terrestrial Rela- 
tions Observatory (STEREO) has shown 
that better tracking of CMEs might give reli- 
able warnings 6 hours in advance, with one- 
hour accuracy. The need to keep improving 
forecasts was shown in March 2012, when 
a major CME approaching Earth sparked 
intense media coverage. Forecasts the day 
before predicted arrival times that varied > 
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> byup to 18 hours. Many were inaccurate, 
although an experimental forecast from the 
National Oceanic and Atmospheric Admin- 
istration (NOAA) in Boulder was close: just 
one hour early. In the end, that CME pro- 
duced only a modest space weather event, 
thanks to the direction of the impacting 
magnetic fields, which cannot yet be forecast. 

In the long term, we still have little sense 
of what maximum space weather event we 
should prepare for. Many at-risk electrical 
systems are designed to withstand events of 
the type seen in the past 40 years or so of 
engineering: many power grids, for exam- 
ple, now require that new transformers are 
able to withstand conditions similar to those 
experienced in 1989. Last year’s earthquake 
and tsunami in Japan show the dangers of 
preparing only for an event similar to that 
seen in recent decades. Instead, we should 
prepare for a space weather event that might 
happen only once in 1,000 years. 

To do this, scientists need to improve 
the availability of older space weather data, 
develop more-sophisticated models to predict 
future scenarios and ensure that the right sys- 
tems — from the electric grid to the Global 
Positioning System (GPS) time-stamps relied 
on for financial transactions — are prepared 
properly. That shift is happening, but not 
quickly enough. 


PREPARING FOR THE WORST 
The media often airs concerns about solar 
activity — there have been many stories 
about the handful of giant solar flares that 
have occurred since January, for example. 
But solar flares have a minimal effect on 
space weather; although they are a good 
indicator of the amount of energy being 
released in a solar event, they do not cause 
CMEs and are not indicators of geomagnetic 
storms. The media also tends to note that 
the Sun is heading towards a maximum in 
its 11-year activity cycle. But the Sun’s activ- 
ity is more complicated than this. Superim- 
posed on the 11-year cycle is a much longer 
cycle that has produced a series of 24 ‘grand 
maxima’ over the past 9,000 years. The 
latest grand maximum, which started in 
about 1920, is now coming to an end, so the 
next solar maximum, due in 2013 or 2014, 
should be unusually low (see ‘Solar cycles’). 

These facts should not cause complacency, 
however. Although a solar maximum might 
bring a higher frequency of more intense 
storms on average, there is no evidence that 
these long-term trends affect the intensity 
of any individual burst of solar activity — 
indeed, the 1859 event occurred outside ofa 
grand maximum. We need to develop safe- 
guards against the entire range of possible 
events that can be generated by CMEs. 

The first step towards this goal involves 
increasing the availability of data, so that 
statistical methods can be used to improve 
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estimates of worst-case scenarios. A 2011 
statistical study’ of digital magnetometer data 
taken in northwest Europe since 1980, for 
example, looked at the intensity of an event 
that is likely to occur once every 200 years. 
For the short-time-frame magnetic field vari- 
ations (in the order of minutes) that threaten 
power grids, the study estimated a worst-case 
intensity in the range of 1,000-6,000 nano- 
teslas per minute. Fortunately, this roughly 
matches the US and UK estimates of a worst 
case of 5,000 nanoteslas per minute, which 
was taken from one observation from central 
Sweden during the May 1921 event. 

Most historical data sets exist only on paper 
as charts or tables, sometimes handwritten. 
These include ionospheric data going back 

80 years and magnetic 


“Geomagnetic data going back 170 
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threat to our gress has been made 
technology- in identifying these 
dependent historical records; 
society. sd they now need to be 


made usable through 
digitization. The advent of web-based citizen- 
science projects may offer a way forward, with 
volunteers transcribing printed or handwrit- 
ten values, or converting graphs to tables. This 
approach is already being used to analyse 
handwritten historical temperature records 
for climate-change studies. There is a large 
public appetite for this type of engagement, 
and such work can prove scientifically use- 
ful for space weather forecasting. In the Solar 
Stormwatch project, for example, volunteers 
measure features of CMEs in real time, and 
the measurements are then used to track and 
forecast the arrival of the eruptions at Earth. 
A second important step to safeguarding 
against CMEs is developing physics-based 
modelling: models that are based on a funda- 
mental understanding of CMEs, rather than 
relying solely on what has been seen before. 


SOLAR CYCLES 


These models require an understanding 
of how CMEs travel through interplanetary 
space and inject energy into Earth’s 
magnetosphere, and of how the resulting 
geomagnetic storm alters the composition, 
temperatures and velocities of the upper 
atmosphere. In this way, extreme events can 
be simulated before they happen. However, 
such modelling is still in its infancy, compa- 
rable with the situation in atmospheric mod- 
elling 50 years ago. Most efforts today focus 
on specific elements in the chain. A partner- 
ship established in 2011 between the NOAA 
and the UK Met Office — part of increasing 
UK-US collaboration on space weather — is 
aiming to build a more integrated, practical 
model for forecasting these events. 

Models for space weather forecasts need 
to include long-term changes to Earth: its 
magnetic field has declined by 10% since 
1850, and the height of upper atmospheric 
layers may be changing in response to 
climate change. Physicists need to learn how 
these factors will affect space weather, and 
that can be done only through good physics- 
based modelling. 

The models are also limited by the com- 
plexity of some of the physics involved. For 
example, extreme space weather conditions 
create a strong flow of plasma of perhaps 
many kilometres per second in the iono- 
sphere. For computational efficiency, models 
generally treat this plasma as a magnetized 
fluid in thermodynamic equilibrium. This 
simple description works well when model- 
ling large-scale features, but it misses the rich 
set of plasma phenomena that arise during 
strong plasma flows. In these very energetic 
conditions, modellers should expect sig- 
nificant deviations from thermodynamic 
equilibrium. Physicists need to develop 
small-scale models that capture this full 
range of plasma physics, and then find com- 
putationally efficient ways to incorporate 
them into large-scale simulations. 


The number of severe* space weather storms each year (top) is related to, but not 
precisely determined by, short- and long-term patterns in solar activity (bottom). 
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Space weather, which sometimes causes auroras on Earth, threatens to play havoc with electrical grids and to disrupt signals from the Global Positioning System. 


A third, more speculative, way to safeguard 
against CMEs is to look at space weather 
around other stars. We can observe flares on 
other stars, for example, using X-ray space tel- 
escopes such as the European Space Agency's 
X-ray Multi-Mirror (XMM)-Newton. In the 
near future, we may also be able to observe 
radio and near-infrared emissions from auro- 
ras on gas-giant planets around those stars. 
That will give us insight into the power car- 
ried by their stellar winds, in particular by 
CMEs. To make a valid comparison with our 
own space weather, we will need to focus on 
Sun-like stars with similar rotation rates to 
the Sun. At present, this limits such studies to 
something between two and ten discovered 
stars. The search for such stars is already a 
hot topic in stellar astronomy, and should be 
encouraged as an area that has the potential to 
broaden our understanding of space weather. 


FUNCTIONAL KNOWLEDGE 

Alongside improvements to our knowledge 
of worst-case-scenarios, we must also ensure 
that crucial industries are prepared. The 
electrical power industry has a good aware- 
ness of the geomagnetic threat after the 1989 
failure in Quebec. Power grids can and do 
harden their systems by installing trans- 
formers that are more resilient to power 
fluctuations, and by having enough reserve 
generators to weather a storm. 

In aviation, severe space weather can black 
out communications links to the ground, dis- 
rupt on-board electronics and increase the 
radiation dose accumulated by aircrew and 
passengers. There is some awareness of these 
risks: airlines operating transpolar routes 
divert to longer routes at lower latitudes dur- 
ing bad space weather (incurring costs of 


extra fuel and crew hours). The US Federal 
Aviation Administration is supporting work 
to establish international standards for space 
weather forecasts for preflight briefings. 

Further work will be needed for both 
industries as the worst-case scenarios 
become clearer. For example, the power 
sector will need to decide whether it is cost- 
effective to add expensive devices to power 
grids to block the extra currents gener- 
ated by space weather, as has been done in 
Quebec. Right now, such discussions are not 
based on sufficient facts. 

A less obvious risk is the potential loss of 
the accurate timing signals provided by GPS. 
Many industries exploit these signals: for 
example, the finance industry needs to time- 
stamp transactions with millisecond accuracy 
to support automated trading. This threat is 
easily overcome by using high-precision local 
clocks that require only occasional synchro- 
nization with GPS, perhaps once every few 
weeks. In this way, industries could survive 
a severe space weather event lasting one or 
two days. When I was preparing a report on 
solar storm impacts with Lloyd’s insurance in 
2010, it was unclear whether industries that 
rely on accurate timing are taking such steps. 
Private operators should be encouraged to be 
open about their efforts to safeguard against 
space weather. 

National organizations that assess space 
weather risks include the UK Civil Con- 
tingencies Secretariat and the US Federal 
Emergency Management Agency. Several 
international bodies also have growing 
space weather efforts, including the World 
Meteorological Organization in Geneva, 
Switzerland, and the European Com- 
mission, the latter of which held a Space 
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Weather Awareness Dialogue in October 
2011. We should combine these efforts 
through an international network, bring- 
ing together all aspects of risk and implica- 
tion assessments, operational responses and 
policy development. 

This area of science has moved away from 
its roots in astronomy and communication 
engineering and is now better handled as a 
generic environmental risk to society and 
the economy, in parallel with earthquakes, 
volcanoes and floods. National and interna- 
tional funding organizations are increasingly 
transferring relevant funds to environmental 
research bodies. This is a welcome develop- 
ment that should be encouraged. 

Geomagnetic storms pose a serious 
threat to our highly vulnerable, technology- 
dependent society. We need a much better 
understanding of the likelihood of space 
weather disruptions and their impacts, and 
we need to develop that knowledge quickly. m 


Mike Hapgood is head of the space 
environment group with RAL Space, part 
of the UK Science and Technology Facilities 
Council. He is based at the Rutherford 
Appleton Laboratory (RAL) in Didcot, UK, 
and chairs an expert group that advises the 
UK government on space weather risks. 
e-mail: mike.hapgood@stfc.ac.uk 
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Leonardo’s anatomy years 


A London exhibition will expose the Renaissance master’s staggering medical 
discoveries, which languished unpublished for centuries, explains Martin Clayton. 


the Renaissance man, but since his day 

he has been seen primarily as a painter 
who dabbled in the sciences. Leonardo would 
not have recognized this image: his scientific 
studies were as important to him as his art. 
Of all his investigations — which included 
optics, geology, botany and hydrody- 
namics — the field that engaged him 
most was human anatomy. 

Eighty-seven of Leonardo’s finest 
sheets of anatomical study from the 
UK Royal Collection's total of more 
than 550 by the artist are to go on 
show from May at the Queen's Gallery 
in London. They range from a fetus 
in the uterus to the vertebral column, 
and will constitute the largest-ever 
exhibition of Leonardo’ anatomical 
work, and the first in the United King- 
dom since 1977. 

The exhibition represents two cam- 
paigns of intense work around 1490 
and between 1507 and 1513, during 
which Leonardo dissected around 
30 human corpses. He fully intended 
to publish his findings, and had he 
done so, he would have transformed 
the study of anatomy in Europe. 
Many of the bodily structures that 
he depicted would not be described 
again for centuries. But Leonardo’s 
perfectionism, his difficulties in rec- 
onciling his observations with estab- 


| eonardo da Vinci is the archetype of 


studio of the innovative sculptor and painter 
Andrea del Verrocchio, where he picked up 
the rudiments of engineering. Then, in the 
1480s, Leonardo moved to Milan, where his 
range of interests widened at a remarkable 
rate and he began to compile material for a 
treatise on the theory of painting. 
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early life. The illegitimate son of a 
notary and a peasant girl, he was 
born in 1452 near the town of Vinci 
in Tuscany, Italy. He learned to read 
and write but his arithmetical skills 
were shaky, and although he taught himself 
Latin as an adult, he was never comfortable 
with the language of most contemporary 
scientific writings. He was to portray this 
as an advantage — claiming that he was a 
“disciple of experience” unencumbered by 
ancient belief — but in reality his position 
outside the scientific-philosophical main- 
stream would deny him the range of contacts 
and resources that his peers enjoyed. 

In his twenties, Leonardo joined the paint- 
ers’ guild in Florence and worked in the 
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Figure 1 | The skull sectioned (1489), one of Leonardo da Vinci’s 
earliest anatomical studies, did not yield the insights that he craved. 


Leonardo had come to see painting as a 
scientific activity, in which every effect (light 
and shade, colour, perspective) and form 
should be based on a true understanding of 
nature. The human body was the principal 
subject of the Renaissance artist and Leon- 

ardo soon realized 
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workings of the mind, 
so that a painting would reveal the 
emotions of the protagonists and 
the human drama of the scene. 
Leonardo thus aimed to under- 
stand the perception of reality 
through the senses, the struc- 
ture of the mental faculties and 
how the nerves configure the 
muscles and bones. How could 
he even begin to investigate 
these topics? Human dissection 
was not banned, as is often sup- 
posed; indeed, a papal bull of 
1482 expressly permitted it. But 
Leonardo was a mere craftsman, 
and — then as now — acraftsman 
could not simply acquire a corpse 
and start cutting it up. Instead, 
he was reliant at first on animal 
, dissection, traditional belief and 
simple speculation. 
In 1489, Leonardo obtained a 
{ human skull. He sectioned it to 
investigate its internal structure, 
_ recording his findings in exquisite 
drawings in a notebook annotated 
in his habitual mirror writing 
(Fig. 1). But the shape of the skull 
(and by implication of the brain) 
failed to provide Leonardo with 
any useful information about the 
relationship between mind and 
body. The gulf between Leon- 
ardo’s ambitions and his achieve- 
ments brought his first wave of 
anatomical research to a halt. In his 
mural The Last Supper, finished in the 1490s, 
the disciples’ poses and expressions convinc- 
ingly capture their varied emotions, but owe 
nothing to his anatomical investigations. 
Leonardo returned to the subject years 
later, in the wake of a commission to paint a 
huge battle scene in the Palazzo della Signo- 
ria in Florence. In preparation for the project 
he executed many drawings of male muscu- 
lature, some so detailed that they can only 
have been based on flayed bodies. Leonardo 
was then in his fifties and one of the most > 
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ure 2 | With The muscles of the shoulder and arm, and the bones of the foot (c.1510-11), Leonardo sought to explore the dynamic interplay of 
bodily elements that underlay the poses of human models in his paintings . 
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> celebrated artists in Italy; at this point in 
his career, he seems to have had little dif- 
ficulty in obtaining permission to dissect 
human corpses. 

In the winter of 1507-08, Leonardo wit- 
nessed the peaceful demise of an old man in 
a hospital in Florence, and wrote in his note- 
book that he performed a dissection “to see 
the cause of so sweet a death” He attributed 
it to a narrowing of the coronary vessels, and 
wrote the first clear description of atheroscle- 
rosis in medical history. He also described the 
pathology of cirrhosis of the man’s liver, which 
he found to be “desiccated and like congealed 
bran both in colour and substance”. 

The dissection of the old man marked 
the beginning of five years of intense ana- 
tomical investigation, and in 1510-11 


Leonardo seems to have collaborated with 
Marcantonio della Torre, the professor of 
anatomy at the University of Pavia. 

Marcantonio provided ready access to 
human material, and Leonardo may have 
dissected up to 20 corpses that winter. He 
concentrated on the bones and muscles, 
analysing their structure in purely mechani- 
cal terms, and the results were spectacular 
(Fig. 2). Perhaps encouraged by the profes- 
sional anatomist, Leonardo illustrated every 
bone except those of the skull, and most of 
the major muscle groups. The completion 
of his treatise was within reach, and on one 
drawing he wrote: “This winter of 15101 
believe I shall finish all this anatomy”. 

But it was not so. In 1511, Milan descended 
into military turmoil. Marcantonio died of 


Figure 3 | The findings recorded in The aortic valve (c.1512-13) went unconfirmed for more than 400 years. 
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the plague, and Leonardo retired to the coun- 
try villa of his assistant, Francesco Melzi. 

With the loss of his supply of human mate- 
rial, Leonardo reverted to the study of ani- 
mal anatomy — most impressively the ox’s 
heart, which differs little in structure from 
that of a human. Leonardo described the 
ventricles and atria with great accuracy, and 
analysed the structure and functioning of the 
valves in minute detail. In a brilliant experi- 
ment, he made a glass model of the aortic 
valve (Fig. 3), through which he pumped 
water mixed with grass seeds to study the 
vortices in the widening of the aortic root. 
He deduced that these vortices were crucial 
in closing of the valve — a finding that was 
confirmed only in the twentieth century. 

Leonardo had an almost perfect under- 
standing of the physiology of the human 
heart. But he had no inkling of the circu- 
lation of the blood, and the existence of 
one-way valves was incompatible with the 
ancient belief that the heart simply churned 
blood in and out of the ventricles, thus gen- 
erating heat and ‘vital spirit. Unable to rec- 
oncile what he had observed with what he 
believed to be true, Leonardo reached an 
impasse. He became trapped in describing 
the motion of the blood through the valves 
in ever more detail. And there, it seems, his 
anatomical work came to an end. 

There is no sign that Leonardo attempted 
to collate his research for publication. On his 
death in 1519 he left his papers to Melzi, and 
although the anatomical studies were men- 
tioned by all Leonardo’ early biographers, 
their dense and disorganized content was 
barely comprehended. Unpublished, the 
studies were effectively lost to the world. 
Elsewhere, anatomical exploration gained 
pace, and in 1543 Andreas Vesalius published 
his epochal De humani corporis fabrica (On 
the fabric of the human body), a model of 
what Leonardo’ treatise could have been. 

The 150 surviving sheets of Leonardo's 
anatomical studies reached England in the 
seventeenth century and eventually made 
it into the Royal Collection, bound into an 
album with 450 of his more artistic draw- 
ings. But it was not until 1900 that they were 
finally published and understood. By then, 
their power to affect the progress of anatom- 
ical knowledge had long passed. 

Leonardos paintings changed the course 
of European art; but his anatomical investi- 
gations, the finest of their age, were essen- 
tially unknown. This exhibition will give an 
unprecedented chance to assess this most 
paradoxical of scientists. = 


Martin Clayton is Senior Curator of Prints 
and Drawings at The Royal Library, The 
Royal Collection, Windsor Castle, Windsor, 
Berkshire, SL4 1NJ. 

e-mail: martin.clayton@royalcollection. 
org.uk 
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Work-life balance 


John Whitfield gauges a study that tackles the 
evolutionary conflicts behind workplace inequity. 


y grandmother was a cleaner; my 
M mother was a medical doctor. That 

is a single example of how, in the 
course of a few decades, women’s educa- 
tional and professional opportunities have 
increased vastly. As of 2010, about 60% of 
US women were in the workforce, making up 
nearly half of all workers; the picture for most 
other developed nations is similar. Women 
can do just about any job, and in many coun- 
tries overt discrimination is illegal. 

But other statistics tell a different story. In 
2010, women made up just 1 in 7 company 
board members and 1 in 40 chief executives 
of Fortune 500 companies. And women still 
earn less than men for the same jobs. In the 
United States and Britain, a female lawyer 
earns, on average, about three-quarters of her 
male counterpart’s salary. For newly qualified 
US medical doctors, the pay gap is widening: 
17% in 2008, up from 12.5% a decade before. 

In The War of the Sexes, Paul Seabright, an 
economist working at his subject’s bound- 
ary with evolutionary biology, argues that 
human evolution can help to explain the 
inequities of today’s white-collar, Western 
workplace. It is a story well worth hearing, 
but it is incomplete — partly because there is 
little decisive evidence for what causes gender 
gaps in employment, and partly because of 
some curious authorial choices. 

The first halfis an excellent primer on why 
males and females have evolutionary cause 
to behave in different ways. Sperm, Seabright 
explains, are cheap and plentiful; eggs, the 
opposite. This tends to make females choosy 
about their mates, and males less so. So 
males must compete for female attention by 
fighting among themselves, or by seducing 
prospective mates with advertisements of 
quality, such as gifts of food. Such differences 
in sexual supply and demand create conflicts 
of interest and incentives for deceit. 

Paradoxically, our cooperative natures 
exacerbate these conflicts. Collaboration in 
child-rearing is key in humans because of the 
long childhood needed to grow a big brain. 
Men can provide resources such as meat, but 
in return, fathers and husbands have sought 
to control what (or, crudely but more accu- 
rately, who) women did. 

In the second half of the book, Seabright 
investigates how such conflicts might 
explain the dearth of women at the top of 
the career ladder. Using evidence drawn 
mostly from the upper echelons of business, 


Seabright suggests two causes of inequality. 

First, anyone who takes a career break 
suffers for it. Even years after returning to 
work, he or she can expect to be paid less 
than a colleague who stayed put. So, Seabright 
argues in a claim that is likely to be contro- 
versial, there is no systematic discrimination 


Despite a rise in professional opportunities for 
women, many are still paid less than men. 


against women here — itis just that they tend 
to sacrifice office-time for child care. The cost 
of this, he says, results from a “signalling trap”: 
to get to the top, an employee must work all 
hours, not because it gets the job done, but 
because bosses take such behaviour as a sign 
of quality and commitment. Like a peacock 
investing in a huge tail to show off his good 
genes at the cost of his flying ability, this is 
hugely inefficient; but anyone who opts out 
unilaterally pays a disproportionate cost. 
Second, Seabright argues that women are 
disadvantaged by networking behaviour. 
Each sex, for example, prefers to network 
with its own kind. In a male-led workplace, 
therefore, men’s social and professional net- 
works tend to overlap more than women’s, 
so women are less likely to find professional 
opportunities. The evidence for this is more 
preliminary than that for the effect of career 
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well connected. 

Throughout the 
book, Seabright is 
terrific company 
— entertaining and 
convincing. In a debate in which both sides 
tend to fit the evidence to their ideology, he 
lets the data take the lead. His recommen- 
dations for a more equal workplace, such 
as compulsory sex balance on job shortlists 
and mandatory paternity leave, are sensible 
and modest. Yet as a whole, The War of the 
Sexes doesn't quite hang together. 

The first half shows that natural selec- 
tion has given men and women different 
priorities, and so different strategies. But I 
struggled to make the leap from the eco- 
nomics of sex to the economics of work, 
including the modern pay gap. I would have 
liked to see a chapter stopping off midway 
between prehistory and the present, to look 
at how developments such as agriculture 
affected men and women. As Seabright 
notes, the biological similarities between the 
sexes show that women’s “subordinate and 
dependent” state must bea relatively recent 
development — in which case, it seems odd 
to rely so heavily on the distant past. 

In particular, the missing 10,000 years 
leave Seabright nowhere to tackle sexism 
and bullying. He assumes that people are 
rational and well-meaning. There is little dis- 
cussion of cultural barriers to equality, such 
as male bosses’ desire to keep women subor- 
dinate and dependent even if it costs them 
money. This is where those at the sharp end 
of the problem lay the blame: last month, a 
survey of women in UK banking found 
that three-quarters think that the biggest 
hurdle to equality is the attitude of senior 
male managers. 

If this were a book by a science writer, I 
would no doubt be moaning that it sought 
to explain all of human life by shoehorning 
it into one grand idea. Because it is a book 
by an academic, I’m moaning that it under- 
coheres, that the circumspection and cave- 
ats impede its argument. This is a dispatch 
from a field in its infancy, and the gaps in its 
thesis reflect the holes in our knowledge. 
But with people such as Seabright working 
to fill them, we can at least be optimistic that 
we will eventually understand what causes 
inequality, if perhaps less positive that we 
can fix it. m 


PAUL SEABRIGHT 
Princeton University 
Press: 2012. 256 pp. 
$24.95, £16.95 


John Whitfield is a writer based in London. 
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Keep a way open for 
tailored treatments 


Two documents with crucial 
implications for the future 

of personalized medicine 

were released last month: a 
report from the US Institute of 
Medicine (IOM) emphasizing 
the importance of open data 
sharing (see go.nature.com/ 
hofgoc) and a US Supreme 
Court decision ruling against 
two patents (see go.nature.com/ 
]j9kyl), which could have the 
opposite effect. 

This creates a quandary for 
companies aiming to pursue 
personalized treatments or 
methods, because those that 
adopt the IOM’s advice to share 
their information now face 
greater hurdles to protecting their 
contributions through patents. 

Companies should not be 
discouraged from exploring 
personalized medicine — to 
do so would deny society the 
benefit of tailored therapies. 
Likewise, firms that prefer to 
keep personalization algorithms 
secret will hinder scientific 
progress. 

Instead, companies could sell 
value-added products that use 
open algorithms and statistical 
models — much like Google and 
IBM, for example, which have 
embraced open-source software. 

To deliver effective treatments 
to patients, we need to find 
a way to reward companies 
for investing in personalized 
medicine while allowing the 
scientific community to advance 
knowledge rapidly. 

Jeffrey T. Leek, Roger D. Peng 
Johns Hopkins Bloomberg School 
of Public Health, Baltimore, 
Maryland, USA. 
jtleek@gmail.com 

R. Reeves Anderson Arnold & 
Porter LLP, Washington DC, USA. 


Boost water safety in 
rural China 


The world is well on its way 
to meeting the Millennium 
Development Goal to halve by 


2015 the proportion of people 
who in 1990 had no sustainable 
access to safe drinking water 
(Nature 483, 128-129; 2012). 
But more resources are still 
needed to improve water 
quality in China, particularly in 
rural areas. 

According to a report released 
last month by the World Health 
Organization (WHO) and the 
United Nations Children’s Fund 
(UNICERP), China contributed 
an estimated 457 million people 
to the 2 billion who by 2010 
had access to improved water 
sources. 

However, these ‘improved’ 
sources (a piped supply 
or borehole, for example, 
as opposed to a stream or 
unprotected well) do not always 
supply safe water, as judged by 
China’s national standards for 
water quality. 

Although 85% of China's 
rural population (605 million 
people) had access to improved 
water sources in 2010, only 
around one-quarter of these 
sources would be considered 
safe by national standards 
(R. Zhang et al. J. Environ. 
Health 26, 3-5; 2009). By 
comparison with 1990 figures, 
this implies that only about 
35 million people in rural 
China had gained access to safe 
water by 2010. For piped urban 
water, however, almost 91% 
of the water supply complied 
with chemical and microbial 
standards (J. Zhang et al. 

J. Shenyang Agric. Univ. 34, 
460-463; 2003), so roughly 
295 million urban Chinese 
gained access to safe water over 
the same period. 

The total Chinese population 
with access to safe water is 
therefore 330 million at most. 
This is about 127 million fewer 
people than estimated by the 
WHO/UNICEF report, a figure 
that is roughly equivalent to the 
population of Japan. 

Hong Yang, Jim A. Wright 
University of Southampton, UK. 
hongyanghy@gmail.com 
Stephen W. Gundry University 
of Bristol, UK. 
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Soil remedies for 
small-scale farming 


In considering how best to 
improve poor agricultural soils 
in parts of Africa and improve 
crop productivity (Nature 

483, 525-527; 2012), micro- 
dosing would be a practical and 
inexpensive way to balance soil 
nutrients. 

Developing countries tend 
routinely to use fertilizers 
containing only nitrogen, but a 
balanced supply of nutrients is 
essential for a crop to flourish. 
Urea application doubled in 
Africa during 1975-2005, 
causing an increase in the 
ratio of applied nitrogen to 
phosphorus and potassium 
(see go.nature.com/eanpxq). 
Although urea is cheap and 
its use is tempting to poor 
farmers in need ofa rapid return 
on investment, it selectively 
depletes soil phosphorus, 
potassium and micronutrients. 
Ultimately, this imbalance will 
render the nitrogen ineffective. 

Micro-dosing soil with 
small quantities of correctly 
proportioned ‘NPK’ fertilizer 
would significantly increase 
crop yields for smallholder 
farmers who are unable to 
afford large-scale applications. 
This simple technique involves 
dispensing measured amounts 
of the fertilizer using a bottle 
cap, for example, into planting 
holes during sowing. The 
returns will more than justify the 
small investment. 

Marijn van der Velde, Linda See, 
Steffen Fritz International Institute 
for Applied Systems Analysis, 
Laxenburg, Austria. 
velde@iiasa.ac.at 


Getting a handle on 
biological data 


We invite interested parties to 
help establish an infrastructure 
to improve the accessibility of 
the ever-increasing volumes of 
biological data. Our objective 
is to prepare a white paper this 
autumn for consideration by 
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the European Commission's 
Horizon 2020 funding call in 
2013, among others. 

New paradigms are needed 
to manage and process 
biological data. Good metadata 
descriptions for each data set 
will help users to locate the 
required facts and to make 
the best use of biological 
information. 

Other infrastructural 
projects already exist, 
including the International 
Nucleotide Sequence Database 
Collaboration, the Catalogue of 
Life and the Global Biodiversity 
Information Facility, but they 
do not yet enable access to the 
broad swathes of data currently 
available. 

Members of the community 
who are involved in generating 
and using biological data need 
to establish priorities so that 
funders can then make targeted 
grant calls, similar to the 
ten-year programme reviews 
established in the physical 
sciences. 

We have initiated a 
community consultation as an 
open document (see go.nature. 
com/tma6gm) that describes 
a series of topics. Volunteers 
are invited to open similar 
documents for each topic, or to 
create a new topic, by adding 
their name and the document 
URL to the main index. A 
public meeting on the initiative 
will be held in Brussels on 
17 July this year. 

Dave Roberts The Natural 
History Museum, London, UK. 
dmr@nomencurator.org 

Alex Hardisty Cardiff University, 
Cardiff, UK. 
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Search for the compass needles 


The idea that bird orientation is guided by magnetic-sensing structures in the animals’ beaks has been challenged by the 
suggestion that the iron-containing cells are macrophages, which have no link to the brain. SEE LETTER P.367 


HENRIK MOURITSEN 


ost humans are hopeless navigators 
M unless they have a map or the Global 

Positioning System to hand. By 
contrast, it is well known that many animals, 
including birds'”, use information from Earths 
magnetic field to orient themselves, even over 
thousands of kilometres. But how do they 
sense the field? Iron-mineral-based structures 
within nerve cells in birds’ upper beaks were 
previously proposed’ to act as one of two main 
magnetic sensors in birds, the other being a 
light-based system in the retina**. However, on 
page 367 of this issue, Treiber et al.° report that 
these iron-mineral deposits are in fact located 
inside macrophages, a kind of immune cell that 
is also involved in iron homeostasis’. 

When people are asked how they think 
birds can sense magnetic fields, most suggest 
that birds must have small ‘compass needles’ 
somewhere in their body. The iron-containing 
minerals magnetite (Fe,0,) and maghemite 
(Fe,O,), which are found in various parts 
of a bird’s body, are particularly well suited 
to sensing magnetic fields. However, these 
minerals can be relevant to magnetic sensing 
only if they are associated with nerve tissue and 
found in the same location in all individuals of 
a given species. In a landmark paper in 2003, 
Fleissner et al.* reported that structures con- 
taining magnetite ‘spherules’ and maghemite 
‘platelets’ are consistently present at six specific 
locations along the upper beaks of pigeons 
(Fig. 1a). These researchers suggested** that 
the structures are magnetic sensors located 
in dendrites — nerve-cell endings. The struc- 
tures included a large substructure that the 
researchers described** as a vesicle. 

If these structures sense the magnetic field, 
this information must be transmitted to the 
brain via the ophthalmic branch of the trigemi- 
nal nerve, as this is the only nerve that enters 
a bird’s upper beak. Indeed, when this nerve 
is cut, pigeons trained to detect a strong mag- 
netic field can no longer perform this task’. 
Furthermore, in European robins, which are 
migratory birds, neurons in the brain regions 
that receive direct input from the trigeminal 
nerve are activated by a changing magnetic- 
field stimulus, and this activation disappears if 
the trigeminal nerve is cut or if the magnetic 


320 | NATURE | VOL 484 | 19 APRIL 2012 


a Dendrite containing magnetic sensor unit 


Vesicle 


Magnetite spherules 


© 
© © 


Connection to 
rest of nerve cell 


b Macrophage 


Ferritin-containing siderosomes Nucleus 


Figure 1 | Ironing out magnetoreception. a, Fleissner and colleagues 
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previously proposed that 


birds’ beaks contain dendrites (nerve-cell endings) harbouring iron-rich structures that can sense Earth's 
magnetic field and transmit this information to the brain, thereby helping the bird to navigate. By using 
electron and light microscopy to observe tissue sections stained with the iron-marking dye Prussian blue 
(blue colour), the authors identified magnetite-containing ‘spherules, maghemite-containing ‘platelets’ 
(too small to depict) and a large substructure that they defined as a vesicle. They classified the structures 
as being clustered at six defined points along the upper beak. b, Treiber et al.° now propose that these 
iron-mineral deposits are instead located in macrophages, immune cells that have a type of organelle 
(siderosomes) containing the iron-storage protein ferritin, and that the substructure is the cell’s nucleus. 
The authors have found that these iron-containing cells are diffusely distributed throughout the beak 


and in other parts of the bird’s body. 


field is removed". Therefore, even though there 
was no direct proof for their function, the iron- 
mineral structures described by Fleissner and 
colleagues**"' have been generally assumed to 
be the primary magnetic receptors associated 
with the trigeminal magnetic sense in birds. 
Treiber et al.° challenge the interpreta- 
tions of Fleissner and colleagues**"". On the 
basis of data that they collected from almost 
200 pigeons, Treiber et al. propose that the 
iron-mineral-containing structures reported 
by Fleissner et al.’ are macrophages (Fig. 1b), 
not magnetic receptor cells. They show that the 
structures are located in variable quantities at 
inconsistent locations along the upper beak, 
and also elsewhere in the pigeon’s body, such as 
in the respiratory tract and skin, and that they 
are not associated with nerve tissue’. They also 
suggest that the spherules, which are strongly 
stained by the iron-marking dye Prussian blue, 
are in fact organelles called siderosomes, which 
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contain the iron-storing protein ferritin, and 
that the large, round substructure is a nucleus, 
nota vesicle®. 

Whose interpretation is correct? The answer 
to this question will be extremely important 
for magnetic-sense research. To form my 
own opinion, I examined many of the origi- 
nal microscope slides that make up the basis 
of Treiber and colleagues’ report’, and com- 
pared them with my previous observations of 
some of the original slides that contributed to 
Fleissner and colleagues’ studies*”". In my view, 
a significant proportion of the iron-mineral- 
containing structures found by Treiber et 
al.° seem to be identical to those previously 
reported* 811" even though some of the struc- 
tures classified as such by Treiber et al. would, 
I believe, be regarded as artefacts by Fleissner 
and colleagues. However, even when I con- 
sider only those structures that look identical, 
these certainly seem to occur at many more 


than six specific locations along the beak. 
Indeed, Treiber and colleagues’ laborious 
quantification shows that the occurrence of the 
iron-mineral-containing structures is highly 
variable between individual birds®. By contrast, 
Fleissner and colleagues’ claim’® of six specific 
locations is not supported by quantitative data. 
This lack of consistent distribution makes it 
highly unlikely that the iron-rich cellsare part 
of a magnetic sensory system. 

The most defining difference between 
macrophages and dendrites is that macro- 
phages have a nucleus and dendrites do 
not. Treiber and colleagues’ data and origi- 
nal slides convincingly show that the iron- 
containing structures are nucleated and that 
there is no regular co-localization between 
Prussian- blue staining and nerve fibres. The 
structures, therefore, cannot be located in 
dendrites. Furthermore, the authors report® 
95-99% co-localization of staining with Prus- 
sian blue and a macrophage marker, strongly 
indicating that most, but maybe not all, of the 
described structures are macrophages. 

In conclusion, I find that serious doubt has 
been raised about the original proposals**" 


ELECTRONICS 


and that the burden of proof now lies with 
Fleissner and colleagues, if they still think that 
their structures are potential magnetic sensors. 
However, it is important to stress that Treiber 
and colleagues’ results cannot exclude the 
possibility that there are iron-mineral-based 
sensors somewhere in the upper-beak region 
of pigeons. Only relatively few sensors may be 
needed. In fact, a magnetite-based sensory cell 
might contain only a few magnetite crystals’” 
and thus evade detection by Prussian-blue 
staining”’, the method used by both groups. 
The implications of this new report are 
dramatic. The evidence supporting a mag- 
netic compass embedded in birds’ visual sys- 
tems"? remains unaffected, but we are left with 
only two studies”"” that clearly implicate the 
ophthalmic branch of the trigeminal nerve, 
although not necessarily iron-related-struc- 
tures, in magnetic sensing in birds. Therefore, 
for now, we have evidence supporting only a 
trigeminal-nerve-related magnetic sense in 
birds, not an iron-mineral-related magnetic 
sense. Both the biological function and basic 
sensory origin of the trigeminal magnetic 
sense are now unknown. Understanding the 


Carbon nanotubes 
finally deliver 


A carbon-nanotube transistor has been made that performs better than the best 
conventional silicon analogues. The result propels these devices to the forefront 


of future microchip technologies. 


FRANZ KREUPL 


very time you turn on your computer 
He browse the Internet or write a pres- 

entation, billions of tiny switches called 
transistors orchestrate a dance of electrons in 
the computer’s processor that is tuned to the 
rhythm of the underlying software. By letting 
electric current flow or not, the transistors, 
which are made of silicon, create the states 
(the ‘zeros’ and ‘ones’) on which our Boolean 
computer world is currently based. Writing in 
Nano Letters, Franklin et al.’ describe how they 
have created a transistor based on carbon that 
promises to outperform its silicon counter- 
parts in future electronic circuitry. 

The improvement in the look and feel of 
computers over the past decades — as they 
turned from clunky, nerdy objects into stylish 
high-tech gadgets — is partly due to the 
fact that silicon transistors have shrunk 
in size from tens of micrometres to about 
20-30 nanometres. However, further scaling 
down of this technology is becoming more and 


more challenging. That is why researchers have 
been looking for a successor to silicon transis- 
tors that would allow further miniaturization 
— hitherto with limited success. 

Franklin and colleagues’ study represents 
progress on this front. The authors show that 
single-walled carbon nanotubes (SWCNTs) 
— semiconductors that have been under close 
scrutiny for more than a decade — can operate 
as excellent switches at molecular dimensions 
of only 9nm (Fig. 1). This is less than half the 
size of the leading silicon technology. And the 
authors’ transistor delivers more current and 
requires less operating power than do silicon- 
based competitors of equivalent size. This is 
the first experiment to clearly demonstrate that 
silicon could be replaced with carbon material 
in future semiconductor technology. 

To operate in the same way as the usual, larger 
silicon transistors, SWCNT transistors must 
show what is known as ‘current saturation at 
low applied voltage. This means that the devices’ 
output current should hardly be affected if the 
applied voltage is varied. Exactly this behaviour 
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magnetic senses in animals indeed remains a 
formidable scientific problem. m= 
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NOOO 


is demonstrated by the authors device’ at a sup- 
ply voltage of less than 0.4 volts, lower than the 
0.64 V that the roadmap’ of the semiconductor 
industry postulates for a device size of 
9 nm, scheduled to reach the market in 2022. 

Early attempts’ to look at the scalability of 
SWCNT transistors revealed that an 18-nm- 
long SWCNT device showed no current 
saturation — the current output was simply 
proportional to the applied voltage. But it was 
unclear whether this was an inherent feature of 
ballistic electron transport (in which no elec- 
trons are scattered) or due to the electrostatic 
design of the device*. Franklin et al.’* investi- 
gated this issue in more detail, and improved the 
current-saturation effect by employing a better 
electrostatic design than that used previously. 

In an earlier paper’, the authors were able 
to demonstrate that the use of a 10-nm-thick 
insulating layer of hafnium oxide, in place 
of the layer of silicon oxide used in the 18-nm 
SWCNT device, gives a much stronger 
electrostatic coupling between the device’s 
gate electrode and the SWCNT, and enables a 
20-nm-long SWCNT transistor to be created 
that has good current-saturation behaviour. 
However, a 15-nm-long SWCNT device, which 
was fabricated in the same batch of transistors, 
did not show this promising signal° because 
the electrostatic coupling was apparently 
weakened by the shorter distance between the 
device’s source and drain electrodes. 

The transistor that Franklin et al. now report 
is based on an electrostatic design involving 
a 9-nm-long channel made of a SWCNT 
and a hafnium oxide insulating layer that is 
only 3nm thick. The reduced thickness of 
the insulating layer produced much stronger 
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Source 
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Figure 1 | A carbon-nanotube transistor. Franklin and colleagues transistor’ consists of a 9-nanometre- 
long channel made ofa single-walled carbon nanotube (SWCNT), which is a single layer of carbon atoms 
arranged ona honeycomb pattern and rolled into a seamless cylinder about 1 nm across. When a voltage is 
applied to the transistor’s gate electrode, electric current flows from the source electrode to the drain electrode. 
A 3-nm-thick layer of hafnium oxide allows strong electrostatic coupling between the gate and the SWCNT. 


electrostatic coupling between the short 
SWCNT channel and the gate electrode than 
is achieved with a thicker oxide. What’s more, 
the authors show that the strong electrostatic 
coupling enables the device to fulfil another 
requirement of an electronic switch: it can 
‘turn off’ the current to values that are low 
enough for applications. 

Although transistors operate as switches, 
they can never turn the current off completely. 
The question is, how much can the current be 
reduced? In Franklin and colleagues’ transistor, 
the off-current was 10,000 times smaller than 


when the channel was open. And it may be 
possible to suppress the current even further by 
deploying a better electrostatic design than that 
used by the authors. Although carbon nano- 
tubes were predicted a decade ago to be the way 
forward for microelectronics, direct experimen- 
tal proof that they can be used to make transis- 
tors at molecular scales was missing until now. 

Franklin and colleagues’ transistor is good 
news, because power consumption has an ever- 
increasing role in electronics, and the device's 
ability to operate at small supply voltages 
should mitigate the demand in energy 


Fossil raindrops 
and ancient air 


An analysis of fossil imprints of ancient raindrops suggests that the density of 
the atmosphere 2.7 billion years ago was much the same as that today. This result 
casts fresh light on a long-standing palaeoclimate paradox. SEE LETTER P.359 


WILLIAM S. CASSATA & PAUL R. RENNE 


hen a main-sequence star such as 
our Sun ages, its inner core becomes 
denser and the temperature at which 


its hydrogen is fused to helium increases. As a 
result, the Sun is currently more luminous, and 
delivers more energy to Earth’s surface, than 
in the past — its energy output around 2 bil- 
lion years ago is inferred’ to have been less than 
85% of that today. Such a faint Sun should not 
have been able to warm Earth's surface above 
the freezing point of water’, yet the geological 
record indicates that liquid water abounded at 
that time. The apparent contradiction between 
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theory (sub-freezing Earth surface tempera- 
tures) and observation (liquid water) is known 
as the ‘faint young Sur’ paradox. On page 359 of 
this issue, Som et al.’ address this paradox using 
seemingly unlikely evidence: fossilized imprints 
left by ancient raindrops”. 

There are ample indications of liquid 
water under the faint Sun. In South Africa, 
for example, exposed rocks more than 3 bil- 
lion years old contain features associated with 
flowing or standing water, including sedimen- 
tary deposits that preserve ripple marks and 
mud cracks’, glassy ‘pillow’ lavas that were 


“This article and the paper under discussion? were 
published online on 28 March 2012. 
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consumption in future microchips. However, 
the road to a real application is bumpy and 
beset with challenges. Some of these chal- 
lenges, such as the reliability and variability of 
the transistors, could be solved by sophisticated 
engineering methods. But others, including 
the creation of pure batches of semiconduct- 
ing nanotubes, are of a more fundamental 
nature. SWCNTSs can exist in ‘flavours, each of 
which displays different electronic properties. 
If billions of SWCNTSs are needed on a chip, 
they must all have the same flavour, which is 
currently difficult to attain. 

Nonetheless, if the progress*’ made in the 
field of nanotubes during the past decade 
is anything to go by, the prospects for solv- 
ing these fundamental problems look good. 
Franklin and colleagues’ results will certainly 
stimulate further research to unlock more 
secrets of carbon nanotubes. = 
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rapidly quenched’ and alga-like microfossils’. 
Indirect evidence, meanwhile, extends the 
record of terrestrial oceans to as early as 4.4 bil- 
lion years ago®. So how can this be explained? 

Solutions to the paradox fall into two gen- 
eral categories: those contending that Earth’s 
atmosphere retained heat more efficiently in 
the past than it does now, for example because 
of increased concentrations of greenhouse 
gases, and those arguing that the albedo, or 
reflectance, of Earth was lower in the past, 
perhaps because there were fewer clouds 
and/or less ice. Most models used to explain 
the paradox are purely theoretical, and were 
designed to highlight the conditions necessary 
to solve it. Unfortunately, few observational 
constraints to support or refute these mod- 
els have been identified, and those that have 
been proposed’ tend to be controversial. 

Som et al.’ have implemented a clever 
approach — first attempted” in 1851 bya pio- 
neer of geology, Charles Lyell — to determine 
the density of the atmosphere early in Earth’s 
history. This information is crucial for assess- 
ing whether greater concentrations of green- 
house gases (such as carbon dioxide”), or of 
other gases (such as nitrogen") that amplify 
the effects of greenhouse gases, could explain 
the faint young Sun paradox. 


W. ALTERMANN/UNIV. PRETORIA 


Specifically, the authors inferred the velocity 
of falling ancient raindrops from the geometry 
of fossilized raindrop-impact marks preserved 
in a 2.7-billion-year-old sedimentary rock 
from South Africa (Fig. 1). The atmosphere 
exerts a drag on raindrops such that they typi- 
cally fall at a terminal velocity that is inversely 
proportional to the density of the atmosphere. 
Any difference between the inferred velocity 
of ancient raindrops and that of those that fall 
today may therefore reflect a change in atmos- 
pheric density. Of course, the imprint generated 
bya raindrop falling at a given velocity depends 
both on the size of the drop and on the nature 
of the substrate onto which it falls. To constrain 
these variables, the authors observed the size 
distributions of naturally occurring raindrops, 
and coupled this information with data from 
experiments in which they let water droplets 
fall onto volcanic ash — mimicking the condi- 
tions in which the fossil raindrops formed. 

Som et al. conclude that the atmospheric 
density 2.7 billion years ago was probably 
50 to 105% of that today. This finding imme- 
diately calls into question solutions to the faint 
young Sun paradox that invoke elevated con- 
centrations of greenhouse gases, unless small 
increases of greenhouse-gas concentration 
were able to exert a large warming effect. It 
is also unlikely that higher concentrations of 
greenhouse-enhancing nitrogen could have 
caused the paradox, because concentrations of 
twice or more the present atmospheric abun- 
dance would have been required to provide 
sufficient warming in the presence of a modest 
increase in carbon dioxide’*. Under such condi- 
tions, the atmospheric density would have been 


Figure 1 | Solid evidence. Som et al.’ have 
analysed the fossilized imprints of raindrops, such 
as those shown here, to determine the atmospheric 
density 2.7 billion years ago. Rule, 5 cm. 
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greater than that predicted by the authors. It 
therefore seems that elevated concentrations 
of highly effective greenhouse gases, such as 
methane”, ethane’ and/or carbonyl sulphide’’, 
may be required to explain the paradox, 
possibly in combination with moderately 
higher concentrations of less-effective green- 
house gases such as carbon dioxide’. A lower 
planetary albedo, caused by the reduction or 
absence of continental ice sheets, could also 
have contributed to warming. 

Although raindrop size distributions associ- 
ated with typical storms are well known, it is 
possible — albeit unlikely — that the ancient 
raindrops responsible for the fossilized 
imprints were unusually large. Small errors in 
the inferred size of the raindrops would result 
in significant errors in the atmospheric pres- 
sure predicted by Som and colleagues’ method’. 
The accuracy of the method is further limited 
by lack of information about factors (such as 
moisture content) that would have affected 
the cohesiveness of the ash in which the fos- 
sil imprints were made; the cohesiveness can 
affect the morphology of impact craters. The 
atmospheric density 2.7 billion years ago could 
therefore have been more than twice that of the 
modern atmosphere if the circumstances under 
which the fossil imprints formed were unusual. 

It is to be hoped that Som and colleagues’ 
work will stimulate further studies of fossil 
raindrop imprints, including perhaps those 
originally observed by Lyell. In particular, it 
will be interesting to see whether coherent 
temporal trends in atmospheric pressure can 
be inferred from imprints in deposits of vary- 
ing ages. With increasing recognition and anal- 
ysis of such features in the geological record, 
it may be possible to establish a chronologi- 
cal record of atmospheric pressure on Earth 
throughout the past 3.5 billion years. Such a 
record would shed light on otherwise poorly 
constrained aspects of climate change deep in 
Earth's history. m 
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50 Years Ago 


Charles Darwin's Orchid Bank at 
Downe in Kent has recently been 
acquired as a nature reserve by the 
Kent Naturalists’ Trust. It is a site 

of great scientific interest which 
shows a wide range of vegetation 
types ... and, as it is only sixteen 
miles from the centre of London, 

the need for conservation is urgent 
and considerable. It is also of unique 
historical importance, for Darwin 

is known to have carried out field 
studies there from his nearby home, 
Down House, where he lived for forty 
years ... Eleven of the thirteen species 
of orchid mentioned by Darwin as 
occurring within a mile of his house 
can still be found in the reserve. 
From Nature 21 April 1962 


100 Years Ago 


The Mind of Primitive Man. By Franz 
Boas — There is a popular fallacy 
that racial antipathy is based on 
physiological foundations. But in so 
far as such antipathy is real, there is 
nothing physiological in its causation 
... The author’s discussion and 
explanation of the causes and results 
of variation within a race ... supply 
the most convincing theory that 

has yet appeared ... The ordinary 
view of the mental deficiencies of 
the “inferior races” is remorselessly 
criticised. The lowest savage does 
possess self-control. He is not 
improvident, but rather optimistic. 
He can concentrate his mind. He 
possesses originality. Savages who do 
not count beyond three or ten easily 
adapt their language and intellect 

to civilised methods of reckoning 

... The point is that these civilised 
methods are not needed in the 
primitive state, where each man on 

a war-expedition is known by name, 
though the number of the troop 
may not be reckoned. Both in mind 
and in body there is little to choose 
between the ordinary barbarian and 
the civilised man. 

From Nature 18 April 1912 
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Tunnelling across 


a nanowire 


The observation of a phenomenon known as coherent quantum phase slip, 
across a nanowire in a superconducting system, paves the way for applications 
in quantum computing and metrology. SEE LETTER P.355 


ALEXEY BEZRYADIN 


uantum mechanics is the most accurate 

theory of modern physics. It was origi- 

nally formulated’ in 1925 to describe 
microscopic particles such as electrons and 
atoms, but whether the theory is applicable to 
the macroscopic world of everyday objects has 
remained unclear. On page 355 of this issue, 
Astafiev et al.* demonstrate that quantum 
theory can describe the tunnelling of magnetic 
flux across a narrow segment of a supercon- 
ducting loop. Taken as a whole, this device 
represents a macroscopic and complex system, 
involving — at the very least — probably many 
thousands of electrons. 

Astafiev and colleagues” have created a type 
of superconducting quantum bit (qubit) that 
was first proposed by Mooij and Harmans’. 
The device's main element is a segment of 
a homogeneous nanowire within a closed 
superconducting loop (Fig. 1). The system 
operates by allowing quantum tunnelling of 
magnetic flux, into and out of the loop, across 
the nanowire. Such tunnelling preserves a form 
of quantum memory known as phase coher- 
ence and is called coherent quantum phase 
slip (CQPS). The device obeys the physics 
of macroscopic quantum systems and has 
implications for fundamental metrology and 
information technology. 

The principle of quantum tunnelling* posits 
that any microscopic particle has some chance 
of penetrating any wall, no matter how high the 
wall’s associated energy barrier. But does this 
effect apply to macroscopic objects? On the 
basis of one modern interpretation of quan- 
tum mechanics’, quantum effects are applica- 
ble to any large system, even the Universe. But 
when applied to macroscopic objects, such a 
global theory would lead to paradoxical pre- 
dictions — for example, Schrédinger’s cat, 
which, according to the principle of quantum 
superposition, can be alive and dead at the 
same time. 

However, another interpretation exists in 
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which large Schrédinger’s cats do not occur 
because of a phenomenon known as sponta- 
neous wavefunction collapse”. Such a collapse 
destroys phase coherence and occurs sponta- 
neously at a low rate. Therefore, experimental 
physicists are avidly testing the applicability of 
quantum theory to the macroscopic world’. 
To do this, they often start with tiny electronic 
devices, which display coherent quantum 
tunnelling between macroscopically distinct 
states”””°, Such small-scale but macroscopic 
quantum systems may be called artificial 
atoms, given the discrete nature of their energy 
states, and can be used as qubits, the building 
blocks of quantum computation. 

The Mooij-Harmans qubit, which Astafiev 
et al. now demonstrate’, can exist either in a 
state in which the superconducting current 
in the loop flows clockwise or in a state that 
has an anticlockwise current. It can also be ina 
symmetric quantum superposition of these two 
states, which is described mathematically by the 
sum of the clockwise and anticlockwise states. 
What's more, the symmetric superposition 
state has a twin state of higher energy, called 
the antisymmetric state. This state is obtained 
by subtracting the anticlockwise state from the 
clockwise state. In their study, the authors pre- 
pared the qubit in the symmetric superposition 
state and, by shining microwave photons on 
the qubit, were able to make it switch between 
the two superposition states. To detect these 
states, they connected the qubit to the centre 
conductor of a microwave resonator system, 
and showed that the two states slow down 
electromagnetic waves differently. 

In a Mooij-Harmans qubit, the transition 
from the clockwise state to the anticlockwise 
state is accompanied by tunnelling of mag- 
netic flux, or quantum phase slip, across the 
nanowire. If the qubit’s wire is continuous, the 
magnetic flux enters the loop by creating what 
is known as a phase-slip core. Such a core is 
similar to the normal core of a vortex of mag- 
netic flux in a superconductor. But CQPS, in 
which the quantum history of the tunnelling 
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Figure 1 | The Mooij-Harmans qubit. The 
two-state quantum system, or qubit, demonstrated 
by Astafiev et al.’ consists of a nanowire segment 
in a superconducting loop made of indium oxide 
and laid on a substrate. The strongly disordered 
electronic structure of indium oxide allows 
coherent quantum tunnelling of magnetic flux, 
known as coherent quantum phase slip, to occur 
across the nanowire. (Figure not drawn to scale.) 


(the coherence) is preserved, is difficult to 
attain because the core dissipates energy, erases 
the quantum memory, causes decoherence and 
suppresses the tunnelling rate exponentially". 
As a result, only incoherent quantum phase 
slips had thus far been detected in homogene- 
ous nanowires”. 

Previous observations of CQPS have 
involved the use of superconducting qubits 
made of nanowires interrupted by thin insu- 
lating junctions. Such junctions create tun- 
nel barriers for the superconducting current, 
but help the magnetic flux to cross the wire 
and enter the loop because superconductors 
expel the magnetic field from their interior 
whereas insulators do not. Also, CQPS in these 
interrupted wires has no core. Therefore, the 
tunnelling rate is not strongly influenced by 
dissipation effects'’. Astafiev and colleagues’ 
superconducting qubit’ is the first of its kind 
in which CQPS is observed across an uninter- 
rupted superconducting nanowire. 

The observation of CQPS in a continuous 
nanowire was possible because the supercon- 
ducting material from which the wire is made 
— indium oxide — has a strongly disordered 
electronic structure. Because of its oxygen 
atoms tendency to bind or localize electrons, 
indium oxide is so disordered that it is not even 
described by the standard Bardeen-Cooper- 
Schrieffer (BCS) theory of superconductivity. 
In fact, the material is almost an insulator. In 
BCS-type superconductors, all free electrons 
participate in a single, collective bound state 


known as the BCS condensate. Such a state 
is like a huge superconducting ‘molecule’ of 
electrons, which moves without friction and 
so leads to superconductivity. In indium oxide, 
the BCS condensate is not one but many such 
molecules. The condensate is localized, mean- 
ing that it fractures into a system of BCS ‘lakes’ 
or ‘droplets, which interact only very weakly 
with one another. 

Astafiev and colleagues’ work suggests that 
the presence of such lakes is crucial to the suc- 
cessful operation of the Mooij-Harmans qubit. 
Ina superconducting wire that contains local- 
ized BCS-condensate lakes, the phase-slip core 
can pass between the lakes and thereby avoid 
any strong dissipation. Hence, instead of the 
conventional approach of making an insulat- 
ing junction in the nanowire to observe CQPS, 
the researchers used a material (indium oxide) 
that is almost in an insulating state. This is a 
neat solution because, on the one hand, the 
wire has some phase-slip cores, but, on the 
other hand, dissipation effects are reduced. 

The present study paves the way for testing 
theoretical predictions for nanowires support- 
ing CQPS and for applications in fields such 
as quantum computing and quantum metrol- 
ogy. It has been proposed“ that CQPS could 
be used to produce a fundamental standard of 
current. One issue that remains to be investi- 
gated is the possibility — or perhaps impossi- 
bility — of attaining CQPS in nanowires that 
are less disordered than indium oxide, and, in 
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particular, in pure, single-crystal nanowires in 
which BCS theory is fully applicable and a well- 
developed phase-slip core exists’. Such systems 
should be compared with those that support 
coreless CQPS’, to quantify the decoherence 
introduced by the phase-slip core. m 
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No lazing on sunny 


afternoons 


In the laboratory, fruitflies rely on an internal clock to alternate activity with 
a midday nap and night-time sleep. Surprisingly, when outdoors, they follow 
temperature rather than the clock, and skip siestas. SEE LETTER P.371 


FRANCOIS ROUYER 


ircadian clocks are biological mecha- 

nisms that allow living organisms to 

adapt their physiology and behaviour 
to day-night cycles. In particular, a neuronal 
circadian clock controls rhythms of rest and 
activity in many animal species, including the 
fruitfly Drosophila melanogaster. In laboratory 
conditions, this insect shows a bimodal activity 
profile, with maximal activity concentrated in 
the morning and evening. However, the artifi- 
cial light-dark cycles used in lab experiments 
only roughly mimic the complex envir- 
onmental changes associated with alternating 
day and night. Vanin et al.' report on page 371 
of this issue that fruitflies indeed behave very 
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differently when exposed to more natural con- 
ditions. Moreover, the authors’ results challenge 
the widely held idea that light and the known 
elements of the circadian clock are the main 
players that shape rest-activity rhythms. 
Fruitflies show 24-hour sleep-wake rhythms 
that persist for weeks in laboratory conditions 
at constant room temperature, even in constant 
darkness. The insects’ circadian clock relies on 
a feedback loop that involves four proteins — 
CLOCK, CYCLE, PERIOD and TIMELESS 
— that bind to specific regions of DNA and 
control the expression of many genes’. Mutant 
fruitflies devoid of any one these four proteins 
lack circadian rhythms. Obviously, though, 
the constant conditions used in the lab are not 
suited to understanding the adaptive role of the 
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clock under natural day-night cycles. 

Why do animals (and humans) 
need an internal clock in a cycling 
environment? Textbooks tell us that 
aclock allows organisms to anticipate 
environmental changes and therefore 
to prepare for such changes. And 
anticipating is exactly what fruit- 
flies do in the laboratory (Fig. la). 
In light-dark conditions (12 hours 
of light and 12 hours of darkness, 
at constant room temperature), the 
insects show little activity at night 
and wake up a few hours before 
the lights are turned on. After their 
morning activity, they take an after- 
noon nap and then become progres- 
sively more active until the moment 
when the lights are switched off and 
sleep time starts. Such bimodal activ- 
ity profiles are thought to restrict the 
fruitflies’ behaviour in the wild to the 
times of the day (morning and even- 
ing) that have the most favourable 
temperature and humidity condi- 
tions. By contrast, clock-less mutant 
fruitflies do not anticipate labora- 
tory light-dark cycles and show only 
‘startle’ responses to light transitions, 
which demonstrates that a clock is 
required for these anticipatory activities. 

To understand how the circadian clock 
affects the insects’ behaviour in more natu- 
ral conditions, Vanin et al.' placed the flies, 
together with activity-recording devices, out- 
side the lab. The devices were shielded from 
direct sunlight but subject to natural varia- 
tions of light and temperature. The authors 
did these experiments from spring to autumn 
in two geographical locations, Leicester (UK) 
and Treviso (Italy), and used clock-defective 
mutants and normal fruitflies. 

These outdoor experiments brought big 
surprises. The researchers observed that nor- 
mal fruitflies, as well as most mutants (includ- 
ing clock-less insects that lacked PERIOD or 
TIMELESS), displayed a prominent afternoon 
activity peak in addition to the morning and 
evening peaks (Fig. 1b). The afternoon activ- 
ity was temperature-dependent: for each of the 
fruitfly strains that showed afternoon activity 
when the mean day temperature was 27°C, 
only 50% of the insects did so at 20°C. Some 
afternoon activity had been reported previ- 
ously* ° when temperature cycles were used in 
lab conditions. When the authors tested per® 
mutant fruitflies, which express a defective 
PERIOD protein, leading to an earlier timing of 
evening activity, they found that the afternoon 
peak also took place earlier. 

Vanin and colleagues’ results’ indicate that 
the afternoon activity is modulated by the 
circadian clock, and seem to exclude a simple 
heat-induced response. They also suggest that 
the insects are highly active in the middle of the 
day under natural conditions, a conclusion that 
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Figure 1 | Rest-activity rhythms in fruitflies. a, In laboratory-induced 
dark-light cycles (12 hours dark, 12 hours light and a constant temperature 
of 25°C), normal fruitflies show morning and evening anticipatory 
activities, whereas clock-less mutant insects do not. b, When Vanin 

et al.’ studied these processes outdoors (in natural light and temperature 
conditions), they found that normal fruitflies and clock-less mutants show 
similar morning and evening activities, as well as a prominent afternoon 
activity peak. The authors’ results suggest that temperature is a key factor 
in shaping the flies’ activity under outdoor conditions. 


contrasts with results of previous lab experi- 
ments” which showed that fruitflies prefer 
low to high light intensity and tend to show 
increased activity when simulated moonlight 
is applied at night. However, in the authors’ 
outdoor experiments’, the amount of activity 
restricted to the morning and evening twilights 
was always less than 25% of the total activity, 
and the insects did not tend to be more noc- 
turnal around the time of a full moon. These 
results suggest that, in the wild, fruitflies are 
much more diurnal than crepuscular, in 
contrast to what lab experiments tell us. 
Vanin et al. then carefully analysed the tim- 
ing of morning and evening activities, which 
are driven apart when day length increases, 
and observed that the onset of morning activ- 
ity was similar in normal and clock-defective 
insects. However, for all the fruitfly strains 
tested, except for clock-less mutants lacking 
TIMELESS, the higher the mean temperature, 
the earlier was the morning wake-up. In other 
words, temperature — and not the clock — 
seems to tell the insects when to become active. 
When the authors used a cover to prevent the 
fruitflies from seeing twilight, the temperature- 
dependent onset of the morning activity was 
delayed. Therefore, twilight shifts the tem- 
perature-dependent start of morning activity 
to an earlier time. Why TIMELESS-lacking 
insects do not similarly respond to temperature 
remains unknown, but it has been shown‘* that 
these mutants show altered diapause, a seasonal 
overwintering response that also requires a 
coordinated reaction to light and temperature. 
What about evening activity? The authors 
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observed that, on cold days (mean 
temperature below 20°C), the flies’ 
evening activity started in the middle 
of the day, between the time of high- 
est light intensity and that of highest 
temperature about 2 hours later. On 
hot days, by contrast, they found that, 
the higher the temperature, the later 
was the onset of evening activity. Tem- 
perature, therefore, sets the phase of 
evening activity. Of note, evening 
onset was earlier in per? mutants, 
which indicates that — in contrast 
to the onset of morning activity — 
the beginning of evening activity is 
controlled by the circadian clock. 

The outdoor experiments per- 
formed by Vanin et al.' shake up our 
assumptions about the timing of 
rest-activity rhythms in the fruitfly. 
The presence or absence of a circa- 
dian clock seems to have less effect 
on the insects’ activity than expected, 
although the clock does regulate after- 
noon and evening activities. As light 
and temperature can show abrupt and 
unexpected variations during the day, 
a key function of the circadian clock 
might be to act as a buffering system 
to keep day activity stable. 

Ironically, although laboratory studies””” 
indicate that the same circadian-clock neurons 
can drive both morning activity and sus- 
tained circadian rhythms, the clock does not 
seem to influence morning activity. These 
neurons might therefore have roles in addi- 
tion to triggering morning activity. It will be 
important to see whether other circadian- 
modulated processes in the fruitfly, such as 
the emergence of young adults from pupae 
(a life stage between larva and adult), obey 
the rules revealed by Vanin and colleagues’ 
experiments. In any case, the present work is a 
starting point for bringing the natural envir- 
onment into the lab and thus for beginning 
to tackle the cellular and molecular basis of 
real-life circadian behaviour. = 


Francois Rouyer is at the Institut de 
Neurobiologie Alfred Fessard, CNRS UPR 
3294, 91198 Gif-sur-Yvette, France. 
e-mail: francois.rouyer@inaf.cnrs-gif.fr 


. Vanin, S. et al. Nature 484, 371-375 (2012). 

. Hardin, P. E. Adv. Genet. 74, 141-173 (2011). 

. Tomioka, K., Sakamoto, M., Harui, Y., Matsumoto, N. & 

Matsumoto, A. J. Insect Physiol. 44, 587-596 (1998). 

4. Busza, A. Murad, A. & Emery, P. J. Neurosci. 27, 
10722-10733 (2007). 

5. Glaser, F. T. & Stanewsky, R. Curr. Biol. 15, 
1352-1363 (2005). 

6. Rieger, D. et al. J. Biol. Rhythms 22, 387-399 (2007). 

7. Bachleitner, W., Kempinger, L., Wiilbeck, C., Rieger, D. 
& Helfrich-Forster, C. Proc. Natl Acad. Sci. USA 104, 
3538-3543 (2007). 

8. Tauber, E. et al. Science 316, 1895-1898 (2007). 

9. Stoleru, D., Peng, Y., Agosto, J. & Rosbash, M. Nature 
431, 862-868 (2004). 

10.Grima, B., Chélot, E., Xia, R. & Rouyer, F. Nature 431, 

869-873 (2004). 


WNr 


REVIEW 


doi:10.1038/nature11032 


Deformation cycles of subduction 
earthquakes in a viscoelastic Earth 


Kelin Wang'’, Yan Hu’} & Jiangheng He! 


Subduction zones produce the largest earthquakes. Over the past two decades, space geodesy has revolutionized our 
view of crustal deformation between consecutive earthquakes. The short time span of modern measurements 
necessitates comparative studies of subduction zones that are at different stages of the deformation cycle. Piecing 
together geodetic ‘snapshots’ from different subduction zones leads to a unifying picture in which the deformation is 
controlled by both the short-term (years) and long-term (decades and centuries) viscous behaviour of the mantle. 
Traditional views based on elastic models, such as coseismic deformation being a mirror image of interseismic 


deformation, are being thoroughly revised. 


subduction fault may stay ‘locked’ for centuries and then 

A suddenly slip to cause an earthquake. Coseismic slip of tens of 
metres can cause devastating shaking and tsunami, as most 
recently demonstrated by the moment magnitude (M,,) = 9 Tohoku, 
Japan, earthquake of 11 March 2011. The rupture and the subsequent 
evolution of stress and strain leading to the next earthquake is loosely 
called a subduction earthquake cycle (SEC). When the theory of plate 
tectonics had just gained wide acceptance, the role of mantle creep in SEC 
was postulated'” to reconcile the overall steady motion of tectonic plates 
and the jerky deformation at their boundaries due to great earthquakes. In 
this early model, SEC stress fluctuations propagate as one-dimensional 
diffusion waves through an elastic plate overlying a viscous asthenosphere 
(Fig. 1a). Its prediction that the strongest elastic deformation should be 
found near plate boundaries proved correct and provided guidance for 
geodetic studies of SEC in the ensuing decades. Maxwell viscoelasticity 
was first applied to earthquake models in the 1970s (ref. 3), in recognition 
that the deep Earth can elastically transmit stress yet behaves like a viscous 
fluid that hosts mantle convection and causes delayed response to the 


a 
a Periodic loading at trench 


Trench 


Figure 1 | Representative SEC models and development of observation 

technology. a, One-dimensional stress diffusion model of 1973 (ref. 2) based 
on ref. 1, a predictive model guiding future observations. b, Two-dimensional 
Maxwell viscoelastic model of 1984 (ref. 6). Observations were based mainly on 


removal of surface ice load (see review in ref. 4). Two-dimensional 
modelling efforts (Fig. 1b) in the 1980s (see review in ref. 5), especially 
those to explain century-long deformation records of the Nankai sub- 
duction zone, southwest Japan®”, established that three primary processes 
took place after each great earthquake (Fig. 2). These are: (1) continuing 
slip of the fault (now called ‘afterslip’), most evidently downdip of the 
rupture zone, (2) viscoelastic relaxation of the earthquake-induced stress, 
and (3) relocking of the subduction fault. These modelling efforts also led 
to the view that asthenosphere viscosity at subduction zones is about 
10'° Pas, one to two orders of magnitude lower than the global average’. 
However, the two-dimensional perspective failed to account for the fact 
that the duration of postseismic relaxation scales with rupture length in 
the third dimension. The consequent lack of universal success raised 
doubts about the importance of viscous deformation in SEC’. 

The launch of the Global Positioning System (GPS) revolutionized 
crustal deformation measurements. Since the early 1990s, GPS measure- 
ments, either in campaign style or as continuous monitoring, have 
delineated patterns of co-, post- and interseismic deformation for many 


terrestrial geodesy, especially repeat levelling surveys. c, Three-dimensional 
Burgers viscoelastic model, used in this review. Today, space geodesy, especially 
GPS (satellites and antennas shown), is the most common means of observing 
contemporary crustal deformation. 
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Earthquake cycle = rupture + @) + (2) + @) 


Figure 2 | Three primary processes after a subduction earthquake. (1) 
Aseismic afterslip occurs mostly around the rupture zone, (2) the coseismically 
stressed mantle undergoes viscoelastic relaxation, and (3) the fault is relocked. 
Arrows at the top show the sense of horizontal motion of Earth’s surface, relative 
to distant parts of the upper plate, caused by each of these three processes. 


subduction zones with unprecedented clarity. Developments in satellite 
radar interferometry and satellite gravimetry further improved the situ- 
ation. In the meantime, progress in computing technology has enabled 
numerical viscoelastic SEC models that more accurately represent three- 
dimensional Earth structure and rupture geometry'’?”* (Fig. Ic). 
However, the vast majority of the SEC models developed in this new 
era, especially those focused on fault locking or fault friction, assume a 
purely elastic Earth. 

High-quality modern geodetic observations have been widely avail- 
able for only one to two decades. For each subduction zone, they provide 
merely a ‘snapshot’ of its SEC evolution. The main reason for the popu- 
larity of the elastic model is the convenience it offers in inferring the 
apparent state of fault locking from each snapshot. However, it has 
become evident that these snapshots collectively reflect a common 
deformation cycle that includes the above-mentioned three primary 
processes (Fig. 2). Since the 1980s, the most important progress includes 
the stronger evidence for very low viscosities of the mantle wedge, the 
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Figure 3 | GPS- (red) and model-predicted (blue) surface velocities for three 
subduction zones that are at different stages of the earthquake cycle. a, At 
Sumatra, one year after the My = 9.2 earthquake of 2004 (refs 20 and 21) 
(epicentre shown by star), all sites move seaward. Shown are ~ 1-year average 
GPS velocities. More recent data show the same pattern’. Coseismic fault slip 
(contoured in metres) is based on ref. 56. Longer (~3-years) time series from 
the three labelled far-field sites (BNKK, CPN, PHKT)” helped constrain 
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recognition that postseismic viscoelastic relaxation consists of both 
transient and steady-state phases, as well as much improved knowledge 
of afterslip. In this review, we also emphasize that the characteristic time 
of the relaxation depends strongly on earthquake size. 


Observing subduction earthquake cycles 


When GPS first became available, it only confirmed what had already 
been inferred from sparse terrestrial geodetic measurements that many 
subduction faults are currently locked and accumulating strain energy 
for future earthquakes. An example is the Cascadia subduction zone’*"®, 
where an M,, ~ 9 subduction earthquake occurred in 1700, and GPS 
sites are now moving landward with respect to the remote areas of the 
upper plate (Fig. 3c). Soon, intriguing GPS velocities were reported from 
Chile’”'® and Alaska’? where a M,, = 9.5 and a M,, = 9.2 earthquake 
occurred in 1960 and 1964, respectively. These data show coastal sites to 
be moving landward as seen at other locked subduction zones, but 
inland sites some 200-400 km from the trench to be moving seaward 
(Fig. 3b). In 2004, a devastating M,, = 9.2 earthquake ruptured the 
Sumatra margin. Most strikingly, GPS sites in a very large area of the 
upper plate have been moving seaward ever since, in the same direction 
as their coseismic motion” (Fig. 3a). In March 2011, the M, =9 
Tohoku earthquake and its tsunami devastated northeastern Japan. 
Similarly to Sumatra, terrestrial GPS sites in northeastern Japan have 
been moving seaward since the earthquake (http://www.gsi.go.jp/ 
chibankansi/chikakukansi40005.html; in Japanese). 

A pattern of SEC evolution for giant earthquakes thus emerged. It 
features a progressive reversal of motion direction away from the rupture 
zone: wholesale seaward motion, followed by opposing motion of the 
coastal and inland areas, and eventual wholesale landward motion. This 
evolution, in a more convincing way than did the Nankai levelling data’, 
invalidates a popular belief that interseismic deformation is a subdued 
mirror image of coseismic deformation. In retrospect, we can see that the 
one-dimensional stress-diffusion model (Fig. 1a) four decades ago’? 
would have predicted the opposing motion currently seen at Chile and 
Alaska, but it took space geodesy to rediscover it. 


Viscoelastic mantle relaxation 

In laboratory experiments, rock creep exhibits an initial transient phase 
of rapid change and a subsequent steady-state phase (see Box 1). 
Traditionally, the study of rock creep is focused on the steady-state 
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afterslip and transient rheology (ref. 48). b, At Chile, four decades after the 
M,, = 9.5 earthquake of 1960, coastal and inland sites show opposing motion. 
Coseismic slip is from ref. 14. For sources of GPS data, see ref. 17. The 
northernmost areas show wholesale landward motion before the 2010 M,, = 8.8 
Maule earthquake. c, At Cascadia, three centuries after the M,, ~ 9 earthquake 
of 1700, all sites move landward. The model is an updated version of ref. 8. A 
more comprehensive GPS compilation shows a similar deformation pattern’®. 
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BOX | 
Steady-state and transient rheology 


Under temperature (7) and pressure (P) conditions appropriate for the 
lower crust and mantle, if a rock specimen is loaded with a constant 
stress a, it undergoes an initial elastic deformation ¢é¢, then transient 
creep of rapidly changing strain rate é;, and eventually steady-state 
creep of constant strain rate és (see Box 1 Fig. 1). Different loading 
stresses result in different steady-state strain rates. These pairs of o 
and és define the commonly seen steady-state flow law®” 

és =Do" exp[—(Q+PV)/RT], where the parameter D depends on the 
rock composition, grain size, and fluid content, and n, Q, Vand Rare all 
constants. 

The value of the stress exponent n depends on the microscopic 
deformation mechanism?®°’. For example, creep due to mass 
diffusion leads to n= 1, but creep due to migration of dislocations in 
crystalline lattice structure leads to n ~ 3. If transient creep is ignored, 
the viscoelastic rock deformation can be described using the 
Newtonian (n = 1) or non-Newtonian (n> 1) Maxwell fluid, which can 
be envisioned as an elastic element with shear modulus (rigidity) uy 
and bulk modulus Kjy in series with this steady-state viscous element. 
For n= 1, the viscosity ny is proportional to D~! exp[(Q+ PV)/RT], 
with the exact form depending on what stress component and strain- 
rate component are used for o and és and how the flow law is 
expressed in three dimensions. Most mantle convection, glacial 
isostatic adjustment, and SEC models assume n = 1, although some 
include strong temperature dependence and/or spatial variation. This 
viscosity is a simplified representation of a more complex system that 
may actually exhibit nonlinearity’. Its correspondence with the 
effective viscosity (a scaled ratio of stress to strain rate) of nonlinear 
flow cannot be quantified if the stress changes rapidly with time as in 
postseismic deformation. 

A widely used parameterization of the transient creep is the Kelvin 
(or Kelvin-Voigt) solid, which can be envisioned as an elastic element 
of shear modulus yu, in parallel with a viscous element of viscosity 17x. If 
the transient creep is considered, the simplest model to depict the 
viscoelastic rock deformation is the bi-viscous Burgers rheology, that 
is, a Kelvin solid in series with a Newtonian Maxwell fluid (see Box 1 
Fig. 2). The spatial variation and stress-dependence of 77x are poorly 
known, although the viscosity ratio B =ny/n is Sometimes assumed 
to be a constant. More complex creep behaviour can be described by 
including additional Kelvin and/or Maxwell elements. There is a recent 
attempt to modify the Burgers rheology to include non-Newtonian 
steady-state creep, but based on an assumption that a stress- 
independent f can still be defined**. 
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Box 1 Fig. 1 | Strain evolution of a rock specimen under constant stress. 
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Box 1 Fig. 2 | Composition of the bi-viscous Burgers rheology. 
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behaviour that is appropriate for long-term processes such as mountain 
building and mantle convection. The viscous component of Maxwell 
viscoelasticity corresponds to the steady-state creep. The Maxwell 
rheology is found to be adequate in modelling century- to millennium- 
scale glacial isostatic adjustment**”’ and decadal-scale SEC deformation 
in Chile'*’””° or Alaska’*"’. 

The widely used elastic model and a steady-state mantle wedge 
viscosity of about 10’ Pas in most Maxwell SEC models* cannot be 
reconciled. What may support the elastic model is the enduring notion 
since 1935 (ref. 25) that upper-mantle viscosity is about 10! Pas, a value 
that would render a Maxwell mantle mostly elastic over an SEC of a few 
centuries. The upper-mantle viscosity structure has been much refined, 
but a global average of 10°’-107' Pas is still valid’”. However, there is 
mounting evidence for lower viscosity at subduction zones, including 
laboratory measurements of relevant rocks”, inference of small-scale 
convection in the backarc based on heat flow observations”, constraints 
from topography and geoid anomalies*’, glacial isostatic adjustment 
analyses*', as well as the inferences from postseismic deformation®. The 
steady-state viscosity used to explain satellite gravity observations follow- 
ing the 2004 Sumatra earthquake is 10'8-10'? Pas (refs 32 and 33). 

Continuous GPS monitoring suggests that immediate postseismic 
deformation and its change with time are much faster than the 
longer-term behaviour. From rock physics and experiments, it is logical 
to infer that the rapid short-term deformation reflects not only afterslip 
but also transient rock creep****. In recent postseismic deformation 
models for Sumatra”’*** and the general SEC model in this review 
(Fig. 1c), a bi-viscous Burgers rheology is employed. The viscoelastic 
relaxation of the earthquake-induced stress thus has two timescales, a 
shorter one for the transient viscosity and a longer one for the steady- 
state viscosity (see Box 1). 


Afterslip and relocking of the fault 


The ubiquitous presence of afterslip is well recognized. A subduction 
fault is a zone of finite thickness with evolving internal structure*’, and 
its slip behaviour is affected by numerous factors including the rough- 
ness of the subducting seafloor’’ and amount of trench sediments”. But 
to the first order it is regarded as a frictional contact. Afterslip is thus 
understood to be the velocity-strengthening behaviour of the fault areas 
around the rupture zone**"°. Deeper than about 70-80 km, where the 
subduction interface is better described as a zone of localized viscous 
shear*’, the distinction between afterslip (a fault process) and viscoelastic 
relaxation (a mantle process) becomes increasingly obscure. 

A range of slip behaviour from stable sliding to unstable stick-slip can 
be explained using a rate-and-state friction law’, but the application 
of the friction theory to realistic fault geometry and Earth rheology is still 
in its infancy. In most viscoelastic SEC models it is necessary to prescribe 
fault motion. The most common way of incorporating fault locking is to 
assign a backslip rate along the locked zone*’, assuming that steady 
subduction and its possible consequence in causing permanent surface 
deformation have been subtracted. 

The characteristic timescale of afterslip, T,, appears to be a few months 
to a few years”. In some cases, if the Earth is assumed to be purely elastic, 
long-lasting afterslip is used to explain prolonged postseismic deforma- 
tion*’. Conversely, there are also cases in which short-term postseismic 
deformation is explained using viscoelastic relaxation without resorting 
to afterslip*"°. The truth lies in between, and determining T, and the 
importance of afterslip relative to viscoelastic relaxation in controlling 
short-term post-seismic deformation remains a challenge. 

In the three primary processes following an earthquake (Fig. 2), fault 
locking has the longest timescale (T,), which is simply the length of the 
interseismic period. Over the past two decades, it has become a common 
mistake to equate ‘interseismic’ (a time concept) with ‘elastic deforma- 
tion’ (a physical process). Even in recent three-dimensional viscoelastic 
models, the effect of fault locking is often simulated using an elastic 
Earth’*""*. Although the relaxation of the earthquake-induced stress is 
listed as one primary process in Fig. 2, it cannot be overemphasized that 
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the other two primary processes also involve their own viscoelastic 
relaxation®’*”’. The stress built up by fault locking is partially relaxed 
at the same time, regardless of how long after the earthquake and 
whether the deformation pattern is still changing with time. Given the 
same locked zone and at a very long time after an earthquake, elastic 
shortening of the upper plate occurs in a broader zone if the mantle is 
undergoing viscoelastic relaxation but in a narrower zone if the mantle is 
purely elastic. For this reason, the maximum depth of locking is usually 
over-predicted if the viscous effect is not or insufficiently accounted 
for'*, such as in all the locking models for northeastern Japan published 
before the Tohoku earthquake”. 


Piecing together an earthquake cycle 


Using a numerical SEC model, we demonstrate that the drastically dif- 
ferent deformation patterns at the three margins shown in Fig. 3 simply 
reflect different stages of a common evolution process. Although a site- 
specific three-dimensional model is needed to describe the structure and 
kinematics of each margin, the same rheological parameters are used for 
all the margins. In effect, there is only one physical model. The three 
margins differ from one another in many aspects, such as the age of the 
subducting plate and convergence rate. That a similar viscosity structure 
can be used to explain their first-order SEC is consistent with the obser- 
vation that the arc-backarc regions of most subduction zones share a 
similar thermal state?’. 

This model is an improvement over previously published viscoelastic 
models for these margins mentioned herein. For example, the use of the 
finite-element method allows the incorporation of realistic fault geo- 
metry and rupture distribution for Sumatra, and both afterslip and 
transient rheology are included for all the three margins. Because the 
transient rheology still has a small influence even a few decades after the 
earthquake, the steady-state viscosity required to explain the Chile 
observations is slightly lower than in previous models'””*. Other import- 
ant model features (Fig. 1c) include the spherical Earth geometry, which 
affects far-field deformation, and the presence of an elastic slab, which 
strongly controls the flow pattern of the mantle material and hence its 
relaxation process. A continuous distribution of afterslip is assigned that 
complements coseismic slip in the strike direction, tapers in the updip 
and downdip directions, and decays with time. Technical details of the 
modelling and the method of prescribing afterslip by trial-and-error are 
explained in ref. 48. 

The relaxation time of a uniform Maxwell body of viscosity 4 and 
rigidity fuy4 is Ty = Nm/ Lm. For the subduction zone system, a mixture of 
elastic and viscoelastic materials, we define an empirical steady-state 
system-relaxation time Thy = (M, y M8) Tm, Where Ty, is the Maxwell time 
of the isoviscous mantle wedge, M, is the seismic moment (the product 
of rigidity, rupture area, and mean slip) of the earthquake that causes the 
stress perturbation, and M$ is a reference moment. The relationship 
between M, and M,, (ref. 49) gives log(M,/M?°) = 1.5(M,—M?), 
where M?. corresponds to M9. The role of rupture length, which two- 
dimensional models fail to account for, is reflected in this scaling factor. 
Similarly, we define an empirical transient system-relaxation time 
Tk= (Mo Fs M2) Tx, where Tx = 1)x/Llx is the Kelvin time for the mantle 
wedge of transient viscosity x and rigidity jy. We assume 
Lim = [x = 64 GPa, ny = 10'° Pas, and yx =5 X 10'” Pas. These vis- 
cosity values are similar to those used in previous Sumatra models’??? 
that employed an analytical solution in a spherical viscoelastic Earth. For 
M° =8.4, these parameters give Ty ~ 4 yr and Ty ~ 80 yr for an earth- 
quake of M,, = 9.2. We also assume that the mantle wedge is less viscous 
than the oceanic mantle by a factor of ten because of the presence of 
fluids from the dehydrating slab*?*"°. The higher-viscosity 10°° Pa s for 
the oceanic mantle is similar to the global mantle average”’. To focus on 
the first-order physics, we have intentionally ignored the spatial varia- 
tions of the viscosities through their dependence on temperature T, 
pressure P and fluid contents. Uncertainties and approximations in 
the viscosity structure are responsible for the arbitrariness in the choice 
of M$ in scaling Ty, and Tx with earthquake size. 
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In explaining geodetic observations, there is a trade-off between Ta, 
Tx and Ty, reflecting uncertainties in the parameter values. 1 is better 
constrained by the observed opposing motion in Chile but can vary by a 
factor of two in the Sumatra and Cascadia models without significantly 
affecting the fit to the GPS data. Up to three years of GPS data from 
Sumatra require both afterslip and transient mantle rheology and help to 
determine T's and 7x (ref. 48). New observations following more recent 
great earthquakes such as the M,, = 8.8 Maule event in 2010, just north 
of the 1960 Chile rupture (Fig. 3b), and the Tohoku event in 2011 will 
surely provide better constraints in the near future. 

The three primary processes (Fig. 2) begin immediately after the earth- 
quake, but they take turns to play dominant roles. At times comparable to 
Ta and Tx, the effect of Maxwell relaxation and fault locking are over- 
shadowed by that of the afterslip and transient (Kelvin) rheology. The 
dominance of the afterslip and transient rheology cause rapid seaward 
motion at Sumatra (Fig. 3a). The Sumatra results also explain the present 
wholesale seaward motion in northeast Japan following the Tohoku 
earthquake. At times comparable to Ty, the effects of both Maxwell 
relaxation and locking become dominant, but the two processes give rise 
to contrasting surface motions (Fig. 2). For Chile (Fig. 3b), sites far away 
from the trench are still moving seaward because of Maxwell relaxation, 
but coastal sites are already moving landward because of fault locking. 
The Chile results also explain the opposing motion currently seen in 
Alaska’*"’. Eventually, at times much longer than Ty, the earthquake- 
induced stresses will be sufficiently relaxed, and the effect of fault locking 
dominates the entire upper plate. This is the situation currently seen at 
Cascadia (Fig. 3c). The Cascadia results also explain the wholesale land- 
ward motion observed before the Tohoku earthquake’. 


Scaling with earthquake size 


The reason that Tx and Ty, scale with the seismic moment is that the 
timescale of relaxation depends on the initial stress perturbation. For 
example, longer ruptures tend to excite longer-wavelength relaxation 
modes, which are generally associated with correspondingly longer 
relaxation times owing to the space-dispersive nature of the relaxation 
process”. Therefore, after a smaller earthquake the effect of fault locking 
becomes dominant more quickly, such that the Sumatra-type seaward 
motion and the Chile-type opposing motion are short-lived or even 
absent”*. Failure to recognize this scaling was a fatal limitation of earlier 
two-dimensional models and caused doubts about the necessity of 
viscoelasticity. Here we illustrate the general role of earthquake size 
(Fig. 4), using idealized models with the same rheology as discussed above. 

These models show how the surface of the upper plate progressively 
reverses its sense of motion from seaward to landward as the effect of 
fault locking becomes more dominant. The Extra-large model 
(M,, ~ 9.2), despite its simplicity, is consistent with predictions using 
models for the three giant events shown in Fig. 3 and current deforma- 
tion in Alaska and northeastern Japan. It indicates that opposing motion 
will continue at Chile and Alaska for another few decades and will 
prevail at Sumatra and northeast Japan within a decade or two. For 
the Large model (My ~ 8.8), complete reversal from seaward to land- 
ward motion will take a decade or two. We predict this will happen in the 
area of the M,, = 8.8 Maule event of 2010, where wholesale landward 
motion was observed before the earthquake (northernmost part of 
Fig. 3b) but wholesale seaward motion is taking place at present*’. 
Published examples for the quick reversal illustrated by the Medium 
model (M, ~ 8.4) include two M,,~ 8-8.1 earthquakes in 1995 in 
Antofagasta, northern Chile’? and Colima-Jalisco, Mexico’. Some of 
the GPS sites in the region of the M,, = 8.3 Kuril event in 2006 (ref. 46) 
have now reversed their motion. At the time of the M,, = 8.4 Arequipa, 
Peru, earthquake of 2001 (ref. 40), there was only one continuous GPS 
station (AREQ) in the affected area, some 200 km from the trench, and it 
reversed its motion about 6 to 7 years after the earthquake. The exact 
timing of the reversal depends also on the plate convergence rate, the 
amount of afterslip, and structural/geometrical details, and therefore may 
differ from these idealized models. The afterslip behaviour may vary from 
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Figure 4 | Evolution of seaward-landward motion transition for different 
earthquake sizes. For each model, areas to the left of the curve move landward, 
and those to the right move seaward. The inset shows the model geometry and 
the line of symmetry (thick line) along which results are shown. Solid and 
dashed colour lines are results with and without afterslip, respectively. Within 
each trio, the models differ in rupture length, coseismic slip (assumed to scale 
with rupture length), and hence earthquake magnitude. The coseismic slip 
tapers both downdip and updip. The ensuing interseismic locking and its 
downdip transition (dashed lines) extend to the entire strike length, regardless 
of the rupture length. Except for the simple geometry and a convergence rate of 
Scmyr |, all parameters are the same as for Fig. 3. The Extra-large model 
qualitatively applies to today’s deformation fields of the five subduction zones 
as indicated. 


event to event depending on such factors as the structure and fluid 
pressure of the fault zone. 

Also important, but not illustrated here, are along-strike variations in 
the degree of interseismic locking, as often inferred from geodetic mea- 
surements’”**. Slowly slipping fault segments serve to load neighbour- 
ing, locked patches and thus affect the time and size of future rupture. 
Such inhomogeneous locking affects interseismic deformation pattern, 
and more so in the near field. 


Looking into the future 
Understanding the physics of interseismic deformation helps us to 
assess seismic and tsunami hazards, because it enables us to infer the 
locking state of the subduction fault. If viscous mantle creep is ignored, 
all surface deformation has to be explained by the locking and slip of 
various parts of the fault, leading to an incorrect depiction of the locking 
state. Fault-locking scenarios based on elastic models, either via forward 
modelling or inversion, should be reassessed to include the effect of 
viscoelastic relaxation. This also applies to other tectonic settings. 
Although our present knowledge of Earth’s viscoelasticity helps us to 
address the first-order physics of SEC, there are critical needs for better 
laboratory and field constraints on the transient rheology and a better 
understanding of the role of nonlinear rheology. The importance of 
better characterization and understanding of afterslip is illustrated in 
Fig. 4 by the large difference in short-term postseismic deformation with 
and without assuming afterslip. Measurements near the rupture zone 
are particularly useful. In this regard, ongoing or planned seafloor 
experiments in the rupture area of the 2011 Tohoku earthquake, such 
as GPS-acoustic surveys, pressure monitoring, and borehole strain mea- 
surements, promise breakthrough discoveries in the next few years. 
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Future seafloor geodesy can also fill the knowledge gap of SEC deforma- 
tion seaward of the trench and yield critical constraints on the rheology 
of oceanic mantle. 

The SEC deformation needs to be understood in a broad geodynamic 
context. It seems that the discovery of episodic slow slip events, some- 
times accompanied with low-frequency seismic tremor, will soon lead to 
much improved understanding of the mechanics of the subduction fault 
and the SEC™. Viscoelasticity gives rise to an asymmetry in SEC, that is, 
coseismic deformation is confined to be near the rupture zone, but 
interseismic deformation occurs in a broader region®. The intriguing 
implications of this asymmetry for geodynamics and earthquake and 
tsunami hazards are yet to be explored. 
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A novel ChREBP isoform in adipose tissue 
regulates systemic glucose metabolism 
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The prevalence of obesity and type 2 diabetes is increasing worldwide and threatens to shorten lifespan. Impaired insulin 
action in peripheral tissues is a major pathogenic factor. Insulin stimulates glucose uptake in adipose tissue through the 
GLUT4 (also known as SLC2A4) glucose transporter, and alterations in adipose tissue GLUT4 expression or function 
regulate systemic insulin sensitivity. Downregulation of human and mouse adipose tissue GLUT4 occurs early in 
diabetes development. Here we report that adipose tissue GLUT4 regulates the expression of carbohydrate- 
responsive-element-binding protein (ChREBP; also known as MLXIPL), a transcriptional regulator of lipogenic and 
glycolytic genes. Furthermore, adipose ChREBP is a major determinant of adipose tissue fatty acid synthesis and 
systemic insulin sensitivity. We find a new mechanism for glucose regulation of ChREBP: glucose-mediated 
activation of the canonical ChREBP isoform (ChREBP-a) induces expression of a novel, potent isoform (ChREBP-) 
that is transcribed from an alternative promoter. ChREBP-f expression in human adipose tissue predicts insulin 
sensitivity, indicating that it may be an effective target for treating diabetes. 


Insulin resistance is a common complication of obesity and a major 
factor in the pathogenesis of type 2 diabetes and cardiovascular disease’. 
Adipose tissue contributes to the development of obesity-related insulin 
resistance through increased release of fatty acids, altered adipokine 
secretion, and/or macrophage infiltration and cytokine release”. 
Altered adipose tissue glucose metabolism is also an important cause 
of insulin resistance, and adipose tissue GLUT4, the major insulin- 
responsive glucose transporter, has a central role in systemic glucose 
metabolism’*”. In insulin-resistant states, GLUT4 is downregulated in 
adipose tissue, but not in muscle’, the major site of insulin-stimulated 
glucose uptake. In addition, mice with adipose-specific GLUT4 over- 
expression (AG4OX) have improved glucose homeostasis’ whereas 
adipose-specific GLUT4 knockout mice (AG4KO) have insulin resist- 
ance and type 2 diabetes*. We investigated how altering adipose tissue 
glucose flux regulates glucose homeostasis. We show that ChREBP, a 
glucose-responsive transcription factor that regulates fatty acid synthesis 
and glycolysis®, is highly regulated by GLUT4 in adipose tissue and is a 
key determinant of systemic insulin sensitivity and glucose homeostasis. 
Also, ChREBP in adipose tissue is required for the improved glucose 
homeostasis resulting from increased adipose GLUT4 expression. 
GLUT4-mediated glucose uptake induces ChREBP, which activates 
adipose tissue de novo lipogenesis (DNL). The latter is associated with 
enhanced insulin sensitivity’'°. In obese humans, adipose ChREBP gene 
expression correlates with insulin sensitivity, suggesting that ChREBP 
protects against obesity-associated insulin resistance. In addition, we 
discovered a novel mechanism for glucose-regulated ChREBP expres- 
sion involving a new isoform, ChREBP-f, which is expressed from an 
alternative promoter in a glucose- and ChREBP-dependent manner. 
In contrast, expression of the canonical ChREBP-« isoform is not 
regulated by glucose flux. However, glucose-induced ChREBP-« 
transcriptional activity increases ChREBP-f expression. Furthermore, 
expression of ChREBP-f is more highly regulated than ChREBP-« in 
adipose tissue in insulin-resistant states. Thus, activation of adipose 


ChREBP, and particularly ChREBP-B, may be a novel strategy for 
preventing and treating obesity-related metabolic dysfunction and 
type 2 diabetes (Supplementary Fig. 1). 


Glucose regulates adipose tissue ChREBP 


To understand the mechanisms by which adipocytes respond to 
changes in glucose flux, we analysed global gene expression in adipose 
tissue from AG4OX and AG4KO mice. Gene-set enrichment analysis’! 
demonstrated coordinate upregulation of DNL enzymes in AG4OX 
mice (Supplementary Table 1), which we confirmed (Fig. la). DNL 
enzymes were downregulated in AG4KO mice (Fig. 1a). Therefore we 
investigated the expression of the two major transcription factors 
known to regulate DNL enzymes, sterol regulatory-element-binding- 
protein 1c (SREBP-Ic; also known as isoform 1c of SREBF1) and 
ChREBP*”*. ChREBP, but not SREBP-Ic, expression is increased 50% 
in AG4OX and decreased 44% in AG4KO adipose tissue compared to 
controls (Fig. 1b). SREBP- 1c transcriptional activity is primarily deter- 
mined by the accumulation of mature SREBP-1c in the nucleus”’. 
However, the nuclear abundance of SREBP-Ic is not increased in 
AG4OX adipose tissue (Supplementary Fig. 2a). Liver X receptors 
(LXR-« and LXR-f; also known as NR1H3 and NR1H2, respectively) 
can regulate the expression of both ChREBP and SREBP-Ic'***, and 
DNL enzymes’’. Expression of canonical LXR targets do not change 
(Supplementary Fig. 2b) in AG4OX or AG4KO adipose tissue, indi- 
cating that LXR activity is unaffected and is not driving the changes in 
ChREBP or DNL enzyme expression. In contrast, expression of Rgs16 
and Txnip, two ChREBP transcriptional targets’”"’? not known to be 
regulated by other lipogenic transcription factors, were reciprocally 
regulated in AG4OX and AG4KO mice (Supplementary Fig. 2c). In 
AG4KO and control mice (Fig. 1c), and also in 30 different mouse 
strains (Supplementary Fig. 3), adipose ChREBP expression strongly 
correlates with GLUT4 expression. Expression of ChREBP transcrip- 
tional targets FAS (also known as Fasn) and Acc1 (also known as 
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Figure 1 | Genetically altering adipose tissue glucose flux regulates the 
expression of ChREBP and its lipogenic targets. a, b, mRNA expression of 
fatty acid synthetic enzymes (a) and lipogenic transcription factors (b) in 
perigonadal fat from 6-week-old female mice (n = 10-14 per group). AU, 
arbitrary units. WT, wild type. *P < 0.05 compared to respective controls. 

c, ChREBP and Glut4 mRNA correlate highly in perigonadal WAT from 
control and AG4KO mice (n = 27). Values are means + standard error. 


Acaca) also strongly correlates with ChREBP expression across these 
strains (Supplementary Fig. 3). Thus, adipose ChREBP may mediate 
the effects of altered GLUT4 expression and glucose flux on lipogenic 
enzyme expression. 


ChREBP mediates effects of adipose GLUT4 


Increased DNL in adipose tissue is associated with enhanced glucose 
tolerance and insulin sensitivity”*'. Consistent with the increases in 
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Figure 2 | ChREBP is essential for the effects of adipose tissue GLUT4 on 
adiposity, DNL, and glucose homeostasis. a, DNL measured in vivo in fed, 
4-month-old male mice. PG, perigonadal; SC, subcutaneous. n = 5-6 per 
group. b, Quantification of western blots of FAS and ACC in subcutaneous fat 
from fed, 6-month-old females (n = 9-10 per group). a, b, *P < 0.05 versus 
same ChREBP genotype, different AG4OX genotype. **P < 0.05 versus same 
AG4OX genotype, different ChREBP genotype. c, Body weights in male mice 


334 | NATURE | VOL 484 | 19 APRIL 2012 


ChREBP and DNL enzyme expression (Fig. la, b), DNL in vivo is 
increased in perigonadal and subcutaneous white adipose tissue 
(WAT) in AG4OX mice compared to wild type (Fig. 2a). 

To determine whether adipose ChREBP is required for the 
increased adipose tissue DNL resulting from GLUT4 overexpression, 
we crossbred AG4OX mice with whole-body ChREBP KO mice’. 
DNL normalizes in adipose tissue after deletion of ChREBP in 
AG4OX mice (Fig. 2a). Glucose incorporation into newly synthesized 
fatty acids is also decreased in wild-type and AG4OX adipose tissue 
after ChREBP knockout (Supplementary Fig. 4). In liver, DNL is 
normal in AG4OX and AG4OX/ChREBP KO mice and is not 
decreased in ChREBP KO mice (Fig. 2a) contrasting with a prior 
report’. This difference may be attributable to differences in age, 
sex, genetic background and/or diet. 

In parallel with a reduction in DNL in AG4OX adipose tissue after 
ChREBP deletion, DNL enzyme messenger RNA and protein expres- 
sion levels also normalize (Fig. 2b and Supplementary Fig. 5). Thus, 
ChREBP is a major regulator of DNL enzymes in adipose tissue and 
mediates the changes in DNL enzyme expression resulting from 
increased glucose flux. 

AG4OX mice are obese (Fig. 2c, d) but have enhanced glucose 
tolerance (Fig. 2e)°. Increased body weight (Fig. 2c), adiposity 
(Fig. 2d) and a tendency towards increased food intake normalize with 
ChREBP deletion; energy expenditure was normal (Supplementary 
Fig. 6). In spite of obesity, AG4OX mice are more insulin sensitive 
than wild-type mice (Supplementary Table 2). Decreased fed glycaemia 
(Supplementary Table 2), markedly enhanced glucose tolerance 
(Fig. 2e), and lower glycaemia during insulin tolerance testing (Fig. 2f) 
in AG4OX mice normalized after ChREBP deletion. Genetically ablating 
ChREBP in wild-type and AG4OX mice impairs insulin sensitivity 
(Supplementary Table 2). ChREBP KO mice show modest glucose 
intolerance (Fig. 2e), as reported®. Glycaemia in ChREBP KO mice 
remains elevated after food removal compared to wild-type mice 
(Fig. 2g). ChREBP knockout prevents the hypoglycaemia that develops 
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on chow (n = 5-7 per group). *P < 0.05 versus all other groups at the indicated 
time. d, Body composition in 8-week-old, female mice (for d-g; n = 10-12 per 
group). *P <0.01 versus all others. e, f, Glucose tolerance test (e) and insulin 
tolerance test (f). *P < 0.05 versus all others; **P < 0.05 versus wild type and 
ChREBP KO. g, Glycaemia following food removal. *P < 0.05 versus all others; 
**P < 0.05 versus ChREBP KO and AG4OX/ChREBP KO; ***P < 0.05 versus 
AG4OX and AG4OX/ChREBP KO. Values are means + standard error. 
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in AG4OX mice after food removal (Fig. 2g). ChREBP deletion did not 
decrease GLUT4 overexpression or glucose uptake (Supplementary 
Fig. 7) in AG4OX adipocytes, indicating that the mechanism by which 
ChREBP regulates glucose homeostasis is not by altering adipose tissue 
glucose uptake. The decreased insulin sensitivity and glucose tolerance 
in AG4OX/ChREBP KO mice is not due to elevated serum triglycerides 
or non-esterified fatty acids, as both metabolites decrease compared to 
AG4OX mice (Supplementary Table 2). The effects of CRREBP KO on 
metabolism and gene expression in AG4OX mice is likely to result 
from the absence of ChREBP specifically in adipose tissue, because 
ChREBP expression is increased in adipose tissue (Fig. 1b) but not in 
liver of AG4OX mice (Supplementary Fig. 8). Reversal of enhanced 
glucose tolerance and insulin sensitivity with normalization of DNL is 
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consistent with a role for adipose tissue DNL in improving systemic 
glucose metabolism. 


Obesity regulates adipose tissue ChREBP 


High-fat diet (HFD) causes obesity and insulin resistance and down- 
regulates GLUT4 expression selectively in adipose tissue’. To invest- 
igate whether adipose ChREBP expression has a role in the insulin 
resistance resulting from adipose GLUT4 downregulation, we sub- 
jected mice to HFD. On chow, AG4OX mice are obese (Figs 2c, d and 
3a). AG4OX mice do not gain more weight on HFD and their degree 
of obesity on both diets is similar to HFD-fed wild-type mice (Fig. 3a 
and Supplementary Fig. 9). HFD induces insulin resistance in both 
genotypes (Supplementary Table 3). In wild-type mice, HFD causes a 
diabetic glucose tolerance test (Fig. 3b) but not in AG4OX mice. The 
enhanced glucose tolerance in AG4OX mice does not result from 
changes in serum metabolites typically associated with insulin 
sensitivity. AG4OX mice have higher non-esterified fatty acids and 
triglycerides, and lower leptin levels on both diets (Supplementary 
Table 3), and lower adiponectin levels’? on chow compared to wild 
type. This pattern is thought to contribute to insulin resistance**”’ 
and not to enhanced insulin sensitivity. 

HED reduces expression of adipose tissue DNL enzymes**”* which 
correlates with insulin resistance”~!°. Therefore, we examined whether 
downregulation of adipose ChREBP and DNL contributes to HFD- 
induced insulin resistance. HFD in wild-type mice reduces DNL in 
subcutaneous but not perigonadal adipose tissue, (Fig. 3c). In 
AG4OX, HFD markedly reduces DNL in WAT, but DNL remains 
2-7-fold higher than in wild-type mice on HFD. This intermediate 
rate of DNL parallels the change in glucose tolerance in AG4OX mice 
on HFD (Fig. 3b). Changes in glucose incorporation into newly 
synthesized fatty acids (Supplementary Fig. 10) are similar to the 
changes in total (from all substrates) DNL (Fig. 3c) in WAT. In liver, 
DNL is not higher in AG4OX on either diet (Fig. 3c). HFD inhibits 
adipose DNL independently of adipose tissue glucose uptake in 
AG4OX mice, as GLUT4 protein levels (Supplementary Fig. 11) 
and glucose transport remain elevated’®. 

To investigate the mechanisms underlying HFD-induced changes in 
adipose DNL, we examined the expression of lipogenic transcription 
factors and DNL enzymes. In wild type mice, HFD diminishes ChREBP 
but not SREBP-Ic expression in WAT (Fig. 3d), as reported”*’. In 
AG4OX WAT, HED markedly decreases ChREBP and also SREBP- Ic 
expression but ChREBP remains elevated compared to wild-type mice 
on HFD. Nevertheless, HFD reduces the degree of ChREBP induction in 
AG4OX adipose tissue despite persistent GLUT4 overexpression 
(Supplementary Fig. 11). Thus, HFD may regulate ChREBP expression 
in part independently of GLUT4 expression and glucose flux, consistent 
with HFD effects on DNL (Fig. 3c). 

Many DNL enzymes that are upregulated in WAT of AG4OX on 
chow are downregulated in HFD-fed wild-type and AG4OX mice 


Figure 3 | ChREBP is regulated in mouse and human adipose tissue in 
pathological conditions. a, Body weight in chow- versus HFD-fed wild-type 
and AG4OX mice (9-week-old, n = 8 per group), *P < 0.05 chow-fed wild type 
compared to all others at the same time point. b, Glucose tolerance test in 
7-week-old males (n = 8 per group). *P < 0.001 versus all others. ¢, Fatty acid 
synthesis measured in vivo in 5-6-month-old males (n = 7 per group). 

PG, perigonadal; SC, subcutaneous. Right, plasma insulin: HFD increases 
insulin (***P = 0.01 by two-way ANOVA). d, e, mRNA expression of ChREBP 
and SREBP- Ic (d) and Acc1 and Fas (e) in subcutaneous fat from 4-month-old 
male mice (n = 8-14 per group). For panels c-e, *P < 0.05 compared to same 
diet, different genotype. **P < 0.05 compared to same genotype, different diet. 
f, Subcutaneous fat ChAREBP mRNA correlates with insulin sensitivity in non- 
diabetic, normal glucose-tolerant humans (m = 123). g, ChREBP and Glut4 
mRNA expression in subcutaneous fat correlate in this group. h, Subcutaneous 
fat ChREBP mRNA correlates highly with insulin sensitivity in obese, non- 
diabetic, BMI-matched humans (n = 38). i, ChREBP and Glut4 mRNA 
expression do not correlate in this group. Values are means + standard error. 
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(Fig. 3e and Supplementary Fig. 12) but remain higher in AG4OX 
WAT compared to wild type in a pattern paralleling ChREBP expres- 
sion (Fig. 3d) and DNL (Fig. 3c). In wild-type WAT, ChREBP down- 
regulation probably accounts for HFD-induced downregulation of 
DNL gene expression, as WAT SREBP-Ic expression is not altered. 
Furthermore, whole-body genetic ablation of SREBP-Ic does not 
diminish expression of these genes in WAT”. 

In contrast, both ChREBP and SREBP-Ic expression are down- 
regulated in WAT in AG4OX mice on HFD compared to chow 
(Fig. 3d). Persistent modest elevation of ChREBP in WAT of 
AG4OX on HFD compared to wild type on HFD (Fig. 3d) is probably 
responsible for the increased DNL that contributes to improved 
glucose tolerance in AG4OX mice (Fig. 3b). Unlike in wild-type 
WAT, in liver, HFD does not downregulate ChREBP and DNL 
enzyme expression (Supplementary Fig. 13). 

LXRs are unlikely to contribute to downregulation of ChREBP 
expression, DNL, or DNL enzyme expression in wild-type mice on 
HED, because expression of LXRs and canonical LXR targets are 
either unchanged or modestly increased (Supplementary Fig. 14). 
MAX-like protein X (MIx) is an obligate dimerization partner for 
ChREBP transcriptional activity’. Neither HFD nor GLUT4 over- 
expression alters MLX expression in WAT (Supplementary Fig. 14). 

We next sought to determine whether adipose ChREBP might con- 
tribute to regulating insulin sensitivity and glucose homeostasis in 
humans. In 123 non-diabetic individuals with normal glucose tolerance 
and widely ranging body mass index (BMI) values (Supplementary 
Table 4), ChREBP expression in subcutaneous WAT correlates strongly 
with insulin sensitivity measured during a euglycaemic and hyper- 
insulinaemic clamp procedure (Fig. 3f). Adipose tissue ChREBP cor- 
relates with GLUT4 (Fig. 3g), consistent with a role for ChREBP in the 
beneficial effects of adipose tissue GLUT4 expression on glucose 
homeostasis. 

Most, but not all, obese people are insulin resistant. To determine 
whether adipose ChREBP expression could have a role in regulating 
insulin sensitivity in obese people, we also investigated adipose 


ChREBP expression in non-diabetic obese individuals with widely 
ranging insulin sensitivity (Supplementary Table 4). Adipose 
ChREBP expression was directly associated with insulin-stimulated 
glucose uptake during a clamp, independent of BMI (Fig. 3h). 
Surprisingly, ChREBP and GLUT4 expression did not correlate in 
these obese individuals (Fig. 3i). Thus, adipose ChREBP expression 
may have beneficial effects on insulin sensitivity, even among obese 
subjects, and this can be independent of GLUT4 expression. 


Identification of a novel ChREBP isoform 


The mechanism by which glucose activates ChREBP transcriptional 
activity is complex and controversial. It has been attributed to xylulose- 
5-phosphate-induced dephosphorylation of specific ChREBP residues”. 
Alternatively, glucose or a metabolite may prevent the interaction ofan 
amino-terminal inhibitory domain with a glucose-activating domain 
within the ChREBP protein*'. Glucose and other carbohydrates can 
also induce ChREBP expression’ but whether this contributes to 
ChREBP activity is unknown. The mechanism for carbohydrate- 
mediated ChREBP induction may involve ‘feed-forward’ auto- 
regulation, because a dominant-negative MLX mutant that inhibits 
ChREBP activity reduces endogenous ChREBP expression'”””. 

To investigate the molecular mechanisms by which ChREBP might 
transactivate its own expression, we searched the genomic sequence 
around the ChREBP transcriptional start site for a carbohydrate res- 
ponse element (ChoRE), which is defined by two E-boxes (CACGTG) 
separated by five nucleotides*’. We identified a single candidate ChoRE 
17kb upstream of the mouse ChREBP transcriptional start site 
(Supplementary Figs 15 and 16). Additionally, a separate E-box was 
identified 255 base pairs proximal to the putative ChoRE (Supplemen- 
tary Fig. 16). In this region, the genomic sequence including the ChoRE 
and 5’ E-box is highly conserved (Supplementary Figs 16 and 17). 
H3K4me3 and H3K4mel histone methylation marks aligned with this 
conserved genomic region (Supplementary Fig. 18), suggesting the 
presence of an alternative promoter™ and potentially an alternative 
first exon. By 5’ rapid amplification of CDNA ends, we confirmed an 
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Figure 4 | Expression of the novel ChREBP- isoform is regulated in a 
glucose- and ChREBP-dependent manner. a, Model of ChREBP-« and 
ChREBP-f gene structure with indication of splice sites and translational start 
sites (ATG). b, Regulation of ChREBP-« and ChREBP-$ mRNA expression in 
perigonadal (PG) fat and liver of 10-week-old, female mice with fasting and re- 
feeding (n = 6 per group). *P < 0.05 compared to fed group. c, ChREBP-« and 
ChREBP-/ mRNA expression in perigonadal fat from 6-week-old female 
AG4OX and AG4KO mice compared to littermate controls (n = 10-14 per 
group). *P < 0.05 versus respective control. d, Glucose regulation of exon 1b 
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promoter-luciferase reporter and indicated ChREBP-f mutants, co-transfected 
with ChREBP-« and Mlx (n = 3 per group). *P < 0.05 compared to non- 
mutated exon 1b-luciferase construct in the same glucose. **P < 0.05 
compared to the same construct in low glucose. e, ChREBP-« and ChREBP-f 
induce an ACC1 ChoRE-luciferase reporter compared to pGL3_basic control 
vector in both low- and high-glucose conditions. *P < 0.05 compared to 
ChREBP-« in the same glucose levels, **P < 0.05 compared to ChREBP-« in 
low-glucose conditions. Values are means + standard error. 
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alternative first exon, exon 1b (Fig. 4a and Supplementary Figs 16-18), 
and cloned an mRNA species transcribed from this alternative 
promoter in which exon 1b is spliced to exon 2, bypassing exon la 
and retaining the remainder of exons present in the canonical ChREBP 
mRNA species (Fig. 4a). 


Physiological regulation of ChREBP isoforms 


To determine whether the two distinct ChREBP mRNA species are 
physiologically regulated, we performed quantitative polymerase chain 
reaction (qPCR) with primers specific for exon 1a (ChREBP-«) or exon 
1b (ChREBP-f). In WAT, ChREBP-« expression declines modestly 
with overnight fasting, and increases 64% after 3 h of re-feeding a chow 
diet compared to mice fed ad libitum (Fig. 4b). In contrast, adipose 
ChREBP-f declines 45% after an overnight fast and increases 4.3-fold 
with re-feeding compared to ad libitum feeding. Thus, ChREBP-f 
expression is more highly regulated in WAT than ChREBP-x with 
fasting and re-feeding. In liver, ChREBP-« expression declines 40% 
and ChREBP-f expression tends to decline (25%) with overnight fast- 
ing. After 3h of re-feeding, ChREBP-« remains suppressed whereas 
ChREBP-f returns to baseline. After 6 h of re-feeding, liver ChREBP-« 
levels return to normal (data not shown). Thus, in liver, the time course 
of ChREBP-« and ChREBP-f regulation differ. 

To determine whether alterations in glucose flux selectively regu- 
late ChREBP-« or ChREBP-/3, we examined the expression of the two 
isoforms in AG4OX and AG4KO WAT. Total ChREBP expression 
increases 50% in AG4OX and decreases 50% in AG4KO WAT 
(Fig. 4d). Adipose ChREBP-« expression remains unchanged in 
AG4OX and declines modestly in AG4KO. In contrast, in WAT, 
ChREBP-f increases 4.6-fold in AG4OX mice, and decreases 97% 
in AG4KO (Fig. 4c). Considering the difference in magnitude of 
change of total ChREBP compared to ChREBP-[}, ChREBP-f is probably 
much less abundant than ChREBP-«. Although expression of both 
ChREBP-« and ChREBP-f are nutritionally regulated (Fig. 4b), 
ChREBP-f alone responds robustly to GLUT4-mediated changes in 
glucose flux (Fig. 4c). 


Glucose-activated ChREBP-a induces ChREBP-~ 


We investigated the molecular mechanism by which ChREBP-f 
expression is regulated in response to changes in adipose GLUT4 
expression. Expression of a ChREBP-f-promoter-luciferase con- 
struct is markedly increased in a glucose-dependent manner with 
co-transfection of ChREBP-« and MLX (Fig. 4d). MLX, the 
dimerization partner for ChREBP, is required for ChREBP transcrip- 
tional activity”. Co-transfection of both ChREBP-« and MLX was 
required for induction of ChREBP-f-promoter-luciferase activity in 
high glucose conditions (Supplementary Fig. 19), indicating that 
ChREBP-f requires transactivation by ChREBP-« for expression. 
Basal and glucose-stimulated expression is attenuated with deletion 
of either the ChoRE or the upstream E-box and abolished entirely 
with deletion of both (Fig. 4d). Glucose and ChREBP-«/MLX had no 
effect on the expression of a luciferase reporter containing 5kb of 
the ChREBP-« promoter, indicating that ChREBP-« does not regulate 
its own expression (data not shown). Thus, ChREBP-o specifically 
regulates expression of ChREBP-f. 

The translational start site for ChREBP-« resides in exon la and 
translation from this site produces an 864-amino-acid protein. No 
translational start site is present in exon 1b and translation beginning 
at the next start site, in exon 4, produces a 687-amino-acid protein in 
which two nuclear export signals, a nuclear localization signal, and a 
domain that inhibits ChREBP transcriptional activity in low-glucose 
conditions are deleted (Supplementary Fig. 20)°'*°. N-terminally 
deleted ChREBP mutants demonstrate increased nuclear localization 
and enhanced transcriptional activity in both low- and high-glucose 
conditions*’. Using an ACC1-ChoRE-luciferase reporter construct”®, 
we compared the transcriptional activity of ChREBP-« and ChREBP-B 
(Fig. 4e). High glucose increased ChREBP-« transcriptional activity 
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2.4-fold over low glucose. ChREBP-f activity in either low or high 
glucose is increased ~20-fold compared to ChREBP-« activity in high 
glucose, and shows no glucose regulation. The elevated ChREBP-B 
transcriptional activity may result in part from absence of the 
N-terminal inhibitory domain or its constitutive nuclear localization 
in low- and high-glucose conditions (Supplementary Fig. 21). These 
results suggest a feed-forward mechanism in which glucose-stimulated 
ChREBP-« transactivates expression of ChREBP-B (Fig. 4d and 
Supplementary Fig. 21), a more potent activator of ChREBP transcrip- 
tional targets (Fig. 4e). 


Pathophysiological regulation of ChREBP-«a/B 


In mice fed a HFD, ChREBP-f expression declines in WAT whereas 
ChREBP-« expression remains unchanged (Fig. 5a), consistent with 
the possibility that downregulation of adipose ChREBP-f contributes 
to insulin resistance. 

In human WAT, expression of both ChREBP-x and ChREBP-f 
correlate with insulin sensitivity (Fig. 5b). Multiple linear regression 
analysis demonstrates that expression of ChREBP-f (f = 0.454, 
P= 0.016) and not ChREBP-x (f = 0.136, P = 0.356) predicts insulin 
sensitivity (Supplementary Table 5). 


Implications 

Adipocytes are key regulators of whole-body energy homeostasis and 
metabolic function**. Our prior work demonstrating that adipose 
GLUT4 expression regulates systemic insulin sensitivity*” indicated that 
adipocytes are capable of sensing and coordinating responses to changes 
in glucose availability. The elucidation here of the molecular mechanism 
by which glucose-mediated ChREBP activation transactivates expres- 
sion of a novel, potent ChREBP-f (Supplementary Fig. 22) isoform 
reveals the complexity underlying cellular responses to nutrient 
availability. Our data demonstrate that adipose tissue ChREBP has a 
key role in integrating adipocyte and whole-body metabolic function 
and this may be mediated by transcriptional regulation of the potent 
ChREBP-B isoform. 

The beneficial effects of adipose tissue ChREBP on glucose 
homeostasis may result from upregulation of adipose tissue DNL 
and/or other ChREBP-dependent actions such as changing the 
metabolic fate of glucose after it enters adipocytes and/or regulating 
a novel adipokine. The strong, direct correlation between adipose 
ChREBP expression and insulin sensitivity in humans suggests that 
adipose ChREBP may be involved in regulating whole-body insulin 
action. The beneficial metabolic effects of increased DNL in adipose 
tissue contrast with the adverse effects in the liver. DNL in liver 
increases intrahepatic triglyceride content and may contribute to 
insulin resistance and the metabolic syndrome*””*. In summary, our 
data support the importance of adipose ChREBP in regulating insulin 
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Figure 5 | Adipose tissue ChREBP-f expression predicts insulin sensitivity. 
a, ChREBP-x and ChREBP-f mRNA expression in subcutaneous (SC) fat of 
4-month-old male mice on chow or HFD (n = 10-14 per group). *P < 0.05 
compared to chow-fed. b, mRNA expression of ChREBP-f in subcutaneous fat 
correlates more highly with insulin sensitivity (percentage increase in insulin- 
stimulated glucose uptake over basal measured during a euglycaemic and 
hyperinsulinaemic clamp procedure) than ChREBP-« in obese, non-diabetic, 
BMI-matched humans (n = 38). Values are means + standard error. 
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action and glucose homeostasis in human obesity and diabetes. These 
data suggest that selective activation of adipose tissue ChREBP, and 
specifically ChREBP-f, could be an effective therapeutic strategy for 
preventing and treating type 2 diabetes and obesity-related metabolic 
diseases. 


METHODS SUMMARY 


AG4OX, AG4KO and ChREBP KO mice were described previously*® except that 
AG4KO mice were generated using adiponectin-Cre-expressing mice” rather 
than aP2-Cre-expressing mice for all studies except for DNA microarrays. 
Phenotypic analyses were performed as described***°. All mouse studies were 
conducted in accordance with federal guidelines and were approved by the 
BIDMC Institutional Animal Care and Use Committee. For microarray analysis, 
RNA was isolated from perigonadal adipose tissue and the cDNA was analysed on 
the Affymetrix MG-U74-A.v2 Genechip. Quantitative real-time PCR was per- 
formed on the Applied Biosystems 7900 HT using SYBR Green PCR Master Mix. 
Cloning, mutagenesis, and luciferase assays were performed by standard methods. 
Human subjects were recruited as described*’*'. Human studies were approved by 
the Human Studies Committee of Washington University School of Medicine or 
the Ethics Committee of the University of Leipzig. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animal studies. Generation and initial metabolic characterization of the 
AG4OX, AG4KO and ChREBP KO mice were previously described** except that 
AG4KO mice were generated using adiponectin-Cre-expressing mice” rather 
than aP2-Cre-expressing mice for all studies except for the DNA microarrays. 
Wild type and adiponectin-Cre littermates were used as controls for AG4KO 
mice. Mice were housed at Beth Israel Deaconess Medical Center with a 14/10 
light-dark cycle and were fed standard chow (Formulab 5008) or HFD (Harlan- 
Teklad TD.93075) ad libitum. All studies were performed on age- and sex- 
matched littermates. Phenotypic analyses were performed as described*>”. 
Blood collections were performed by submandibular vein or tail vein bleeding. 
Body composition was measured by NMR (Echo Medical Systems). Glucose 
tolerance tests were performed by injection of 1 mg glucose per kg body weight 
intraperitoneally after 5h food removal. For the fasting glycaemic time course 
experiment, food was removed at 8:00. Mice were killed by CO, euthanasia or 
decapitation and serum was collected, and tissues were harvested, snap frozen in 
liquid nitrogen, and stored at —80°C for processing. Mouse studies were con- 
ducted in accordance with federal guidelines and were approved by the BIDMC 
Institutional Animal Care and Use Committee. 

Human studies. For the cross-sectional cohort, adipose tissue samples were 
obtained from 123 men (n = 64) and women (n = 59) who underwent open 
abdominal surgery for gastric banding, cholecystectomy, appendectomy, weight 
reduction surgery, abdominal injuries or explorative laparotomy. These patients 
represent a normal glucose tolerant subset of patients previously described“. All 
subjects had a stable weight, defined as the absence of fluctuations of >2% of body 
weight for at least 3 months before surgery. Patients with malignant diseases or 
any acute or chronic inflammatory disease, as determined by a leukocyte count of 
>7,000 Gpt]', C-reactive protein levels of >50 mg!” * or clinical signs of infec- 
tion, were excluded from the study. After resection, samples of visceral and 
subcutaneous adipose tissue were immediately frozen in liquid nitrogen. 
Euglycaemic and hyperinsulinaemic clamp studies were performed before 
surgery in subjects undergoing bariatric surgery or 3 months after surgery in 
subjects who underwent non-elective surgeries. The clamp study was performed 
with an insulin infusion rate of 20 mU kg ' min‘, as described, and the glucose 
disposal rate was defined as the glucose infusion rate during the last 30 min of the 
study*'. The study was approved by the ethics committee of the University of 
Leipzig. All participants gave written informed consent before taking part in the 
study. 

For the obese cohort, 38 obese subjects (10 men, 28 women; 41 + 11 years old) 
who had normal oral glucose tolerance were studied. Subjects were sedentary 
(that is, participated in regular exercise <1 h per week and =1 time per week) and 
weight stable. All subjects provided written, informed consent before particip- 
ating in the study, which was approved by the Human Studies Committee of 
Washington University School of Medicine in St Louis. 

Subjects were admitted to the Clinical Research Unit at Washington University 
School of Medicine in the evening before the clamp procedure. The next morning, 
after subjects fasted for 12h overnight, a hyperinsulinaemic and euglycaemic 
clamp procedure in conjunction with [6,6-’H]glucose tracer infusion was per- 
formed, as described previously*’. Euglycaemia was maintained at a blood glucose 
concentration of approximately 5.6mmoll~' (100 mgdl”') by infusing 20% 
dextrose enriched to 2.5% with [6,6-7H2] glucose. Skeletal muscle insulin 
sensitivity was determined as the relative increase in the rate of glucose uptake 
during insulin infusion (50 mU m *min ’). 

Microarray analysis. Total RNA from epididymal adipose tissue was extracted 
using the RNeasy Mini Kit from Qiagen from three mice from each of four 
genotypes: aP2-Cre transgenic littermates (controls for AG4KO mice), AG4KO 
mice; FVB littermates (controls for AG4OX) and AG4OX. RNA from each mouse 
was hybridized on an Affymetrix MG-U74-A.v2 Genechip microarray. 
Affymetrix gene chip hybridization and analysis were performed at the 
Genomics Core Facility of the Beth Israel Deaconess Medical Center. Genome- 
wide expression analysis of the microarray data were performed using gene set 
enrichment analysis (GSEA)'' and the s2.mgu74av2.gmt gene set database 
available through the Broad Institute. 

Gene expression analysis. Mouse tissues were harvested following CO, 
euthanasia or decapitation, snap frozen in liquid nitrogen, and stored at 
—80 °C for processing. Total RNA was extracted from frozen tissue with TRI 
Reagent (Molecular Research Center). Reverse transcription was performed using 
the Advantage RT-for-PCR kit (Clontech). Quantitative PCR was performed 
using SYBR Green PCR Master Mix (Applied Biosystems) in a 7900 HT 
thermocycler (Applied Biosystems). SDS 2.3 (Applied Biosystems) was used for 
calculation of cycle thresholds. Relative expression levels were determined using 
the standard curve method with normalization of target gene expression levels to 
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18s ribosomal RNA (18s) or TATA box binding protein (Tbp). See Sup- 
plementary Table 6 for primer sequences. 

For the cross-sectional cohort, total RNA was isolated from subcutaneous 
adipose tissue samples using TRIzol (Life Technologies) and cDNA was synthesized 
with standard reagents (Life Technologies). Quantitative PCR was performed 
using Brilliant SYBR Green QPCR Core Reagent Kit (Stratagene) in a PRISM 
7900 thermocycler (Applied Biosystems). Relative expression levels were determined 
using the standard curve method with normalization of target gene expression levels 
to 18s. 

For the obese cohort, total RNA was isolated from subcutaneous adipose tissue 
samples using TRIzol (Invitrogen) and cDNA was synthesized using Taqman 
Reverse Transcription (Applied Biosystems). Quantitative PCR was performed 
using SYBR Green PCR Master Mix (Applied Biosystems) on the ABI 7500 Real- 
Time PCR System (Applied Biosystems). Relative expression levels were deter- 
mined using the 2~““ method with normalization of target gene expression 
levels to ribosomal protein, large, PO (36B4) gene. 

Western blotting. For adipose tissue lysates, aliquots of frozen adipose tissue 
were homogenized on ice in RIPA buffer supplemented with phospho-preserving 
and anti-protease agents (sodium fluoride, sodium pyrophosphate, sodium 
orthovanadate, PMSF, aprotinin and leupeptin). For adipose tissue nuclear 
lysates, an adipose tissue sample was ground under liquid nitrogen, then dounce 
homogenized in low salt buffer (potassium chloride 10 mM; HEPES pH 7.9, 
10mM; EDTA 0.1mM; EGTA 0.1mM; DTT 1 mM) with protease inhibitors. 
Homogenates were centrifuged at 1,000g for 10 min at 4°C and nuclear pellets 
were collected and washed with low salt buffer. The nuclear pellets were resus- 
pended in RIPA buffer and vortexed vigorously. Nuclear lysates from 3-4 mice 
from the same genotype were pooled for further processing. Protein concentra- 
tions were assayed using the BCA assay. Equal amounts of protein were loaded 
and transferred to nitrocellulose membranes. Membranes were probed with 
antibodies against acetyl-CoA carboxylase (Cell Signaling), fatty acid synthase 
(Cell Signaling), GLUT4 (provided by H. Haspel), SREBP1 (Novus Biologicals), 
and lamin B (Santa Cruz) and PI3K p85 (Upstate). Film images were scanned 
(Epson Expression 10000 XL) and results were quantified with ImageQuant TL 
software (GE). 

In vivo fatty acid synthesis. Conscious, freely moving mice in the fed state were 
injected intraperitoneally with 5 mCi of *H,O (MP Biomedicals) and 10 Ci of 
[U-'*C]-glucose (Perkin Elmer) and killed 1 h later using an overdose of ketamine 
and xylazine (160 and 24mgkg ' intraperitoneally, respectively). Samples of 
plasma were obtained serially at 5, 10, 30 and 60 min after injection for measure- 
ment of the plasma glucose levels and the specific activity of plasma water and 
glucose. At sacrifice, tissues were rapidly removed, frozen in liquid N> and stored 
at —80°C for processing. Lipids were extracted by the Folch method with 
chloroform/methanol (2:1). Fatty acids were obtained by saponification and 
extraction with petroleum ether. Incorporation of *H and “C into saponified 
fatty acids were measured. The rate of synthesis was calculated as a molar rate 
based on the estimate that 13.3 mol of H2O are incorporated into each newly 
synthesized C16 fatty acid’. The rate of glucose incorporation into newly 
synthesized fatty acids was calculated as nanomoles of '*C-glucose incorporated 
into fatty acids per gram tissue per minute, using '*C counts in saponified fatty 
acids extracted from tissue normalized by the time-averaged specific activity of 
glucose in plasma over the course of the hour-long experiment. 

Glucose uptake in isolated adipocytes. Adipocytes were isolated from 
perigonadal fat pads and glucose uptake was measured as previously described’. 
Briefly, perigonadal fat pads were digested with collagenase (1 mg ml _') and cells 
were incubated at 37 °C with constant shaking in Krebs-Ringer- HEPES (30 mM) 
buffer (pH 7.4) with 2% bovine serum albumin, 200 nM adenosine, and without 
(basal) or with (insulin-stimulated) 80 nM insulin. After a 30-min incubation 
with or without insulin, U-'*C glucose (3 1M) was added for 60 min and the 
reaction was terminated by separating cells from media by spinning the suspen- 
sion through dinonyl-phthalate oil. A portion of isolated adipocytes from each 
sample were fixed with osmic acid and counted in a Coulter counter to normalize 
glucose uptake per cell. 

Identification of ChoRE and ChREBP exon Ib. A position-weighted matrix 
representative of a consensus ChoRE was generated from sequences for 16 mouse, 
human, or rat ChoREs previously identified and experimentally validated 
(Supplementary Fig. 15). Genomic sequence 20 kb upstream and downstream 
of the mouse ChREBP transcriptional start site was obtained from the UCSC 
Genome Browser (Mouse July 2007 (NCBI37/mm9) Assembly)**“* and was 
scanned using the Transcriptional Element Search System (http://www.cbil. 
upenn.edu/tess) and the positional weight described above. Genomic sequence 
in the exon 1b region was obtained through the UCSC genome browser and 
comparisons across species were performed with Megalign (Dnastar) using the 
Clustal W algorithm. Analysis of histone methylation marks was performed using 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


the Encode browser and database***°. ChREBP exon 1b was cloned from RNA 
prepared from AG4OX adipose tissue by 5’ rapid amplification of cDNA ends 
(GeneRacer, Invitrogen). A full-length ChREBP-f mRNA species was cloned 
using GeneRacer reagents. All cloned products were verified by sequencing. 
Functional analysis of ChoREs. PCR of bacterial artificial chromosome contain- 
ing the mouse ChREBP gene (CH29-535]6, Children’s Hospital Oakland Research 
Institute) was used to generate the mouse ChREBP- fs promoter sequence (—631 to 
+224, in reference to exon 1b transcriptional start site), which was cloned into the 
pGL3_basic reporter vector. The putative ChoRE sequence (+157 to +174) and 
E-box (—98 to —92) were deleted by site-directed mutagenesis (QuickChange II 
Site-Directed Mutagenesis Kit, Agilent). The exon 1b promoter_pGL3, mutant 
pGL3 constructs, or an empty pGL3_basic control were co-transfected using 
Lipofectamine LTX (Invitrogen) in HEK293T cells with Flag-tagged ChREBP-o 
(C-isoform) and HA-tagged MLX, both in the CMV4 vector”, along with a renilla 
luciferase control. Following transfection, cells were cultured in DMEM with high 
(25mM) glucose. Twenty-four hours after transfection, cells were cultured in 
DMEM containing low (2.5 mM) versus high (25 mM) glucose. Cells were col- 
lected and firefly luciferase activity was measured 24 h after the media change and 
normalized to renilla luciferase activity (Dual Luciferase Assay System, Promega). 
All experiments were repeated at least three times with comparable results. 
Functional analysis of ChREBP-B. Flag-tagged ChREBP-B was generated by 
deletion of N-terminal sequence in the mouse Flag-tagged ChREBP-o 
(GenBank accession AF245475) in the CMV4 vector” by site-directed mutagenesis 
(QuickChange II Site-Directed Mutagenesis Kit, Agilent). ChREBP-B sequence was 
confirmed by sequencing. Flag-tagged ChREBP-o, Flag-tagged ChREBP-f, or 
empty pGL3_basic control were co-transfected with HA-tagged MLX and a 
reporter plasmid containing a promoter region consisting of two copies of the 
ACCI1 ChoRE fused to the firefly luciferase gene”*. Following transfection, cells 
were cultured in DMEM with high (25 mM) glucose. Twenty-four hours after 


transfection, cells were cultured in DMEM containing low (2.5 mM) or high (25 
mM) glucose. Cells were collected and firefly luciferase activity was measured 24h 
after the media change and normalized to renilla luciferase activity (Dual 
Luciferase Assay System, Promega). Similar results were also obtained when 
normalizing luciferase activity to ChRREBP-o or ChREBP-B protein levels, respect- 
ively (data not shown). All experiments were repeated at least three times with 
comparable results. 

Statistics. All values are given as means + standard error. Differences between two 
groups were assessed using unpaired two-tailed Student’s t-tests unless otherwise 
specified in figure legends. Comparisons between multiple groups were performed 
by ANOVA with pairwise comparisons and adjustment for multiple comparisons 
by the Tukey test unless otherwise specified in figure legends. For analyses with 
non-normal distributions or unequal variances, ANOVA and pairwise com- 
parisons were performed on log-transformed values. All data presented in the 
figures are of non-transformed values. Owing to the large variance and non- 
normal distribution of GLUT4 expression in the obese human cohort (Fig. 3i), 
log mRNA expression levels were analysed and presented. Statistical analyses 
including linear regressions were performed with SPSS 8.0. 
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A unique regulatory phase of DNA 
methylation in the early mammalian 
embryo 
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& Alexander Meissner"? 


DNA methylation is highly dynamic during mammalian embryogenesis. It is broadly accepted that the paternal genome 
is actively depleted of 5-methylcytosine at fertilization, followed by passive loss that reaches a minimum at the 
blastocyst stage. However, this model is based on limited data, and so far no base-resolution maps exist to support 
and refine it. Here we generate genome-scale DNA methylation maps in mouse gametes and from the zygote through 
post-implantation. We find that the oocyte already exhibits global hypomethylation, particularly at specific families of 
long interspersed element 1 and long terminal repeat retroelements, which are disparately methylated between gametes 
and have lower methylation values in the zygote than in sperm. Surprisingly, the oocyte contributes a unique set of 
differentially methylated regions (DMRs)—including many CpG island promoters—that are maintained in the early 
embryo but are lost upon specification and absent from somatic cells. In contrast, sperm-contributed DMRs are 
largely intergenic and become hypermethylated after the blastocyst stage. Our data provide a genome-scale, 
base-resolution timeline of DNA methylation in the pre-specified embryo, when this epigenetic modification is most 
dynamic, before returning to the canonical somatic pattern. 


Cytosine methylation in mammals is an epigenetic modification that 
is largely restricted to CpG dinucleotides and serves multiple critical 
functions, including stable repression of target promoters, maintaining 
genomic integrity, establishing parent-specific imprinting patterns, and 
silencing endogenous retrotransposon activity’*. In somatic tissues, 
CpG methylation exhibits global patterns based on relative CpG 
density: CpG islands at housekeeping or developmental promoters 
are largely unmethylated, whereas non-regulatory CpGs distributed 
elsewhere in the genome are largely methylated’. This landscape is 
relatively static across all somatic tissues, where most of the methylated 
CpGsare pre-established and inherited through cell division. Generally, 
only a small fraction of CpGs switch their methylation status as part of 
an orchestrated regulatory event*’. 

DNA methylation is much more dynamic during mouse germ cell 
and pre-implantation development. The classical model postulates 
that at fertilization, a targeted, although widespread, catalytic process 
actively removes DNA methylation contributed by the paternal 
gamete. Recent evidence implicates a demethylation mechanism that 
transitions through a hydroxymethylated intermediate that is cata- 
lysed by the Tet3 member of the Tet family*’. However, only a portion 
of hydroxylated targets seems to be actively catalysed to complete 
demethylation, and the identity of these targets remains unknown”. 
After fertilization, there appears to be a passive loss of global DNA 
methylation levels that continues until the blastocyst stage, where the 
inner cell mass (ICM) that gives rise to the embryo proper is first 
specified (reviewed by ref. 11). Recent evidence indicates that this 
passive depletion may also be facilitated in part by Tet-enzyme- 
mediated hydroxylation”®. After specification of the ICM, the embryo 
implants into the uterine lining in concert with gastrulation, which 


co-occurs with global remethylation of the genome that is believed to 
contribute to lineage restriction and the loss of cellular potency’*”’. 
Unfortunately, on a quantitative, genome-wide scale, little is 
known about the specific dynamics of cytosine methylation during 
these earliest developmental stages'*. The classical model is drawn 
from observations made using either global measurements, such as 
immunohistochemistry, or from limited analysis at individual 
loci'!!*'**?, Key questions about DNA methylation patterns in early 
development remain open, including which genomic features are 
specifically targeted, as well as the identities of DMRs inherited from 
either gamete beyond known imprint control regions (ICRs). Here we 
use genomic high-resolution methylation profiling” to gain insight 
into the underlying mechanisms and regulatory principles of DNA 
methylation as it functions in early mammalian development. 


Genome-scale methylation maps of murine 
embryogenesis 

To generate a global and high-resolution view of early mammalian 
DNA methylation dynamics, we collected oocytes and sperm, as well 
as zygote, 2-, 4- and 8-cell cleavage stage embryos, the early ICM and 
embryonic day (E)6.5/7.5 post-implantation embryos (Fig. la and 
Supplementary Figs 1 and 2). All samples were extensively washed 
and purified to remove any somatic or gametic contaminants; 
maternal biasing from meiotic polar bodies (representing a 1X or 
0.5X static genomic contaminant, respectively) was excluded by 
mechanical biopsy (Supplementary Fig. 1 and Supplementary 
Movie 1) and was further confirmed by assessing the paternal 
(129X1/SvJ) to maternal (C57BL/6 X DBA/2) ratio of known single 
nucleotide polymorphisms (SNPs) (Supplementary Figs 3 and 4). We 
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Figure 1 | Global CpG methylation dynamics across early murine 
embryogenesis. a, Samples isolated for methylation analysis with replicate 
number () highlighted. d.p.f., days post fertilization; h.p.f., hours post 
fertilization. b, Fraction of 100-bp tiles with high (=0.8, red), intermediate 
(inter, >0.2 and <0.8, green) and low (=0.2, blue) methylation values. Brain, 
heart and liver tissue are shown for adult comparisons. c, Histogram of 
methylation values across 100-bp tiles. n is the number of tiles for each stage. 


generated reduced representation bisulphite sequencing (RRBS)* 
libraries from each stage to provide a comprehensive timeline of 
DNA methylation patterns during early mouse embryogenesis. 

Compared to all other genome-wide profiling strategies currently 
available, RRBS is optimally suited for the low cell numbers that can 
be obtained from embryonic samples****. Within our range of ~0.5- 
10 ng genomic DNA, RRBS provides the expected genomic coverage 
and high reproducibility (Supplementary Fig. 2). On average, we 
obtained the methylation status of 1,062,216 CpGs for comparative 
analysis (Supplementary Table 1). Bisulphite sequencing cannot dis- 
tinguish between methyl- and hydroxymethylcytosine (hmC), and 
current methods for global profiling of hmC lack the sensitivity to 
investigate the pre-implantation stages in this study”’**'. Thus, we 
cannot draw any definitive conclusions regarding the base resolution 
hmC distribution, but this modification has not yet been linked to a 
regulatory mechanism other than to potentiate demethylation”. 
Given this ambiguity, regions of high mC/hmC methylation, espe- 
cially those retained over multiple time points, could still be expected 
to function as if methylated. 


The global embryonic pattern is unique 

Current models postulate a phase of global hypomethylation during 
mammalian pre-implantation development that reaches a minimum 
at the morula/blastocyst stage. However, it is unknown which genomic 
regions are affected or how similar the embryonic methylation pattern 
is to the adult’. To address these questions, we investigated the global 
dynamics of CpG methylation using 100-base-pair (bp) tiles 
(Methods). We found that oocytes are already globally hypomethy- 
lated compared to sperm (0.32 median methylation in oocyte versus 
0.83 in sperm; Supplementary Fig. 5). We examined the relative pro- 
portion of genomic regions at each stage falling into high (=0.8), 
intermediate (>0.2 and <0.8) or low (=0.2) methylation categories. 
Notably, oocyte methylation levels more closely resemble those of early 
embryonic time points than the levels in sperm, post-implantation 
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d, Box plots of methylation values across local CpG densities highlight the 
difference between hypomethylated pre-implantation tissues and the somatic 
pattern seen in sperm, post-implantation and adult samples. Circle indicates 
the median, edges the 25th/75th percentile and whiskers the 2.5th/97.5th 
percentile. e, CpG density distribution for >0.2 methylation (left panel) 

and <0.2 methylation (right panel) tiles in stages that display somatic versus 
embryonic patterning (red and blue lines, respectively). 


embryos, or adult tissues (Fig. 1b). We also observed a gradual increase 
in the fraction of tiles that exhibit intermediate and low methylation 
values from oocytes to the early ICM, which is consistent with loss of 
methylation over multiple cleavage divisions (Fig. 1b). 

Sperm and post-implantation embryos show a strong inverse 
relationship between CpG density and methylation levels that is 
characteristic of somatic cells. In oocyte and pre-implantation 
samples, this dependence is weaker (Fig. 1c, d). In both pre- and 
post-implantation embryos, methylated CpGs (>0.2) tend to occur 
in low CpG density regions, as they do in somatic cells (Fig. le, left). 
However, the alternative relationship between higher CpG density 
and low methylation is not as apparent in the oocyte or the pre- 
implantation embryo (Fig. le, right). In summary, pre-implantation 
development represents a unique developmental period where 
methylation is differentially positioned and regulated before being 
restored and maintained in a somatic fashion. 


Two transitions in early development 


We next searched for substantial changes in regional DNA methyla- 
tion through development. For each pair of consecutive stages, we 
compared methylation levels of each tile and classified it as changed if 
the difference exceeded 0.2 and was significant according to a false 
discovery rate (FDR)-corrected t-test. The most dramatic changes in 
DNA methylation occurred during two developmental transitions: 
between sperm and the zygote and between the early ICM and the 
post-implantation embryo (Fig. 2a). At each of these transitions, most 
changes were unidirectional (Fig. 2b): a gross reduction upon 
fertilization (mean = 0.47 decrease for 37% of tiles examined) 
and massive remethylation from the ICM onwards (mean = 0.46 
increase in methylation at 66% of tiles). Within E6.5 and E7.5 post- 
implantation embryos, the methylation levels at most of the assayed 
tiles were stable or increased slightly (Fig. 2b). However, more subtle 
global changes, reflecting a gradual decrease in methylation, were 
observed from zygote/early cleavage through to the 8-cell stage and 
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Figure 2 | Major transitions in DNA methylation levels during early 
development. a, 100-bp tiles available for pairwise comparison across 
consecutive embryonic stages. Tiles that remain unchanged (stable) at the 
indicated transitions are shown in light blue. Tiles that change by greater than 
0.2 and are significant by t-test are highlighted in dark blue. b, 100-bp tiles with 
increasing (red) or decreasing (green) methylation levels at each consecutive 
transition show that major transitions are largely unidirectional. c, Box plot of 
methylation levels for sperm-specific DMRs (n = 134,038 tiles). Red line 
indicates the median, edges the 25th/75th percentile and whiskers the 2.5th/ 
97.5th percentile. d, Box plot of methylation levels for oocyte-specific DMRs 
(n = 6,394 tiles) as in c. e, Seventy-four CpGs within sperm-specific DMR tiles 
(c) could be ascribed to paternal or maternal alleles and tracked across stages. 
Paternal CpG methylation values (blue line, median; coloured space, 25th/75th 
percentile) decrease by the zygote stage whereas maternal CpG methylation 
(red line, median; coloured space, 25th/75th percentile) remains unchanged. If 
untracked, these CpGs have an intermediate methylation value between those 
ascribed to a parent of origin (black line). 


into the ICM, where methylation levels reached their lowest observed 
values (Fig. 1b, c). 


The oocyte defines the early methylation landscape 


Active demethylation is expected to occur before pronuclear fusion or 
the completion of DNA synthesis"’**. When we compare methylation 
patterns between sperm and zygote, most regions in the genome show 
reduced methylation in the zygote with few sizeable changes in 2-cell 
embryos (Fig. 2b). Notably, the vast majority of tiles that are methy- 
lated at significantly different levels between gametes show higher 
methylation levels in sperm than in oocyte and many are reduced 
to levels near those of the oocyte (Fig. 2c, d). Using SNPs, we con- 
firmed this observation by tracking 74 CpGs that fell within these tiles 
and could be assigned paternal- or maternal-specific values. Zygotes 
displayed a decrease in paternal methylation in contrast to maternally 
contributed CpGs, which remained unmethylated (Fig. 2e). Zygotes 
isolated here are likely in earlier stages of S phase, such that either a 
passive, replication-based mechanism could result in the synthesis of 
unmethylated, nascent DNA or DNA methylation could be removed 
by a targeted process'***”°. The similarities in methylation levels 
between zygote and the 2-cell stage, which represents one complete 
round of replication, is indicative that at least some observed 
demethylation is a consequence of targeted removal, but distinguish- 
ing between these two models may be complicated by the coupling of 
proposed base-excision repair mechanisms and DNA replication 
itself. 

The few regions that are significantly hypermethylated in oocyte 
compared to sperm exhibit intermediate values in the zygote, suggest- 
ing a more direct inheritance of the allelic methylation state (Fig. 2d). 
The disparity in the zygotic resolution of regions that are differentially 
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methylated between the gametes indicates that the oocyte largely 
reflects the zygotic/pre-implantation methylome and prescribes its 
architecture (Supplementary Fig. 6). Thus, the oocyte methylome, 
rather than the sperm methylome, seems to be more reflective of 
patterns in the early embryo. 


Retroelement dynamics at fertilization 


Consistent with a demethylation model, we confirmed that the vast 
majority (96%) of tiles that are hypermethylated in sperm in our data 
set become less methylated in the zygote. Most of these tiles already 
exhibit lower methylation in the oocyte, such that additive effects 
could explain more subtle decreases in many regions. Interestingly, 
tiles exhibiting the most extreme methylation changes during the 
sperm to zygote transition are enriched for long interspersed elements 
(LINEs) (P<4.7 x 10 '84, FDR<0.05, hypergeometric enrich- 
ment) (Fig. 3a and Supplementary Table 2). We directly estimated 
the methylation level for individual LINEs”’ surveyed by RRBS at each 
stage and found that changes in these elements are markedly bimodal 
during the sperm to zygote transition, with 18% of LINEs reducing 
their methylation values by over 0.45 (Fig. 3a). By comparison, 10% of 
captured long terminal repeat (LTR) retroelements exhibit similar 
levels of demethylation, but the distribution is not as clearly bimodal 
(Fig. 3b and Supplementary Table 3). Short interspersed elements 
(SINEs) are generally less methylated in sperm than other repeat 
classes**, and also exhibit shifts in their methylation values from 
sperm to early embryo, but without the apparent bimodality observed 
for LINE elements (Supplementary Fig. 7). 

Surprisingly, LINEs that changed most dramatically during the 
sperm to zygote transition largely consisted of two closely related 
families of Ll LINEs: L1Md_T and L1IMd_Gf (Fig. 3c, d, 
P<47x10 '*, P<79X10 °° hypergeometric enrichment 
test)’””°. Repeats from these families had the largest and most consist- 
ent decrease, whereas those from other equally represented families, 
such as LIMd_A elements, showed smaller changes upon fertilization 
and maintained higher methylation values in both oocyte and zygote 
(Fig. 3e and Supplementary Fig. 8). Similarly, several LTR families 
showed considerable loss of methylation within the zygote (Fig. 3f, 
g), whereas the class II intracisternal A-particles (IAPs, Fig. 3h) did 
not. The latter finding is consistent with the known retention of high 
methylation levels of [APs throughout cleavage’””’. 

Interestingly, during early development, all retrotransposons resolved 
identically, reaching minimal values at the ICM stage before increasing 
to more somatic levels by E6.5/7.5 (Fig. 3i). Thus, repeat elements exist in 
a less methylated state primarily in the oocyte pre-implantation stages 
(Supplementary Fig. 7). Bisulphite sequencing cannot address if 
methylated cytosines at these repeats are converted to hmCs before a 
subset is further targeted for complete demethylation. Some mCs may 
be targeted for active demethylation via this intermediate form, whereas 
the remaining mC/hmC residues may lose their methylation more 
passively through cleavage, consistent with recent metaphase immuno- 
staining results’”. 


Gametes confer distinct features as DMRs 


Although loss of methylation is widespread, some epigenetic informa- 
tion must be differentially contributed by the two gametes, including 
known ICRs that maintain their allele-specific methylation pattern 
throughout embryogenesis”. We systematically searched for inherited 
DMBs contributed from either gamete by applying linear regression to 
all tiles that had mean methylation =0.75 in one gamete and <0.25 in 
the other. We identified 376 oocyte-contributed DMRs with inter- 
mediate methylation levels in the zygote (P< 0.047, FDR <0.05, 
ANOVA; linear regression residual <0.29, FDR <0.1; Fig. 4a) and 
4,894 sperm-contributed DMRs (Fig. 4c). Notably, oocyte-contributed 
DMBs reside primarily in CpG island-containing promoters (Fig. 4b 
and Supplementary Table 4), whereas sperm-contributed DMRs are 
predominantly intergenic (Fig. 4d). The sperm- and oocyte-contributed 
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Figure 3 | Specific families of LINE and LTR retroelements exhibit the most 
dramatic methylation changes in the sperm to zygote transition. 

a, Histogram of the difference in methylation levels, where negative values 
represent tiles decreasing from sperm to zygote, within LINE retroelement 
features that are captured by RRBS. 85% of the elements have a significant 
difference (P < 0.04, FDR <0.05; t-test). The distribution is bimodal with 18% 
of elements displaying a change in methylation status £0.45 as highlighted in 
red. b, Differences in methylation between sperm and zygote within annotated 
LTR retroelements. Compared to LINEs, a smaller fraction of elements appear 
to be regulated by DNA demethylation (61% significant, 10% of those sampled 
exhibiting changes =0.45 as highlighted in red). ce, Box plots of methylation 
levels in oocyte, sperm and zygote (top panels) as well as the distributions of 
change in methylation levels between sperm and zygote (bottom panels) for 
specific LINE-1 families, including those that are (c, d) or are not dynamic 
(e). Top panels: the red line indicates the median, edges the 25th/75th percentile 
and whiskers the 2.5th/97.5th percentile. Bottom panels: members of each 
family that are demethylated by greater than 0.45 are highlighted in red. 

f-h, Box plots of methylation levels in oocyte, sperm and zygote (top panels) 
and the distributions of change in methylation levels between sperm and zygote 
(bottom panels) for specific families of LTR retroelements, including 
MMERGIN (f), RLTR10C (g) and IAP elements (h). Top and bottom panels as 
in c-e. i, Mean methylation level for all elements of the L1Md_A LINE (solid 
blue line) and IAP LTR class (solid red line) that do not markedly change 
contrasted by LINEs (dashed blue line) and LTR elements (dashed red line) that 
show the greatest loss at fertilization. SINE elements (green line) are less 
methylated in sperm than other repeat elements and appear to decrease to 
oocyte levels. 


DMRs also differed substantially in their relative CpG densities 
(Supplementary Fig. 9). 

We next focused specifically on oocyte-contributed promoter 
DMRs, in part due to their unusual enrichment for high CpG- 
containing promoters (HCPs). Although this set had no clear func- 
tional enrichment, it did include the promoters of several interesting 
genes that are not expressed during later stages of oogenesis, such as 
Dnmt3b and the somatic isoform of Dnmt1 (refs 41-43), which sug- 
gests a possible regulatory function for at least some of these DMRs. 
The use of genotyped strains allowed us to confirm that the methyla- 
tion proximal to the CpG island promoter of copine VII (Cpne7), 
another putative DMR, was directly inherited from the oocyte 
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Figure 4 | Differentially methylated regions represent discrete gamete- 
specific feature classes. a, Heat map of methylation levels (black, 0; red, 1; grey, 
missing value) in 376 identified 100-bp tiles (rows) that behave as oocyte- 
contributed DMRs in the zygote. Tiles are sorted by functional classes and 
clustered within each class. Fifteen known ICRs, shown at the bottom, behave 
similarly in the early embryo and retain intermediate methylation through 
implantation. Other includes both Other and No annotation. b, Genomic 
features (top) and promoters of different CpG densities (bottom) in oocyte- 
contributed DMRs. Top: oocyte DMRs are enriched for promoters. Bottom: 
most of the 105 promoters that overlap oocyte-contributed DMR tiles are high 
CpG density promoters containing CpG islands (HCPs, light blue). c, Heat map 
of methylation levels (black, 0; red, 1; grey, missing value) in 4,894 identified 
100-bp tiles (rows) that behave as sperm-contributed DMRs in pre- 
implantation embryos. Tiles are sorted and highlighted as in a. d, Genomic 
features in sperm-contributed DMRs are generally intergenic. 


(Fig. 5a). Asa set, oocyte-contributed promoter DMRs retained inter- 
mediate methylation values from the zygote through to the ICM, 
followed by resolution to hypomethylation in the specified embryo 
(Fig. 5b, c). Thus, CpG island methylation is transiently stabilized 
during cleavage divisions before re-establishing an unmethylated 
state around implantation. A distinct methylation pattern during 
pre-implantation development is also observed in sperm-contributed 
DMRs, which retain intermediate methylation values through to the 
ICM, before being hypermethylated post-implantation to typical 
somatic levels (Fig. 5d). 

RRBS is designed to enrich for CpG dinucleotides (sixfold), but it also 
captures the other three non-CpG dinucleotides at normal frequencies. 
Of these, CpA is the predominant target for methylation in mouse and 
human****. Consistent with previous locus-specific observations**”’”, we 
found that oocytes had the highest global CpA methylation level 
observed across pre-implantation development, and that this level 
decreased by ~50% in the zygote stage (Fig. 5b and Supplementary 
Fig. 10). This indicates that non-CpG methylation is inherited as part 
of oocyte-contributed methylated alleles but is likely lost more rapidly. 


Discussion 

To better understand the regulation of methylation patterns during its 
most dynamic phase, we generated genome-scale maps of DNA 
methylation in both gametes and through implantation. We find that 
methylation contributed by sperm to the zygote is altered most within 
retroelements of specific families, whereas other elements remain more 
protected and retain higher methylation levels throughout development 
(Supplementary Fig. 11). The methylation status of the oocyte is a strong 
predictor of levels in the zygote, and regions that are already hypomethy- 
lated in the oocyte could explain much of the disparity between the early 
embryo and sperm. The mechanism and targets of DNA demethylation 
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Figure 5 | DMRs resolve after cleavage to univalent hyper- or 
hypomethylated values in a gamete-of-origin-specific fashion. a, Single CpG 
resolution methylation within 2 kilobases (kb) of the Cpne7 promoter in 
gametes and across embryonic development (rows). Dark grey bar highlights 
the CpG island. A CpG proximal to the island can be tracked to a phase 
resolving SNP and is highlighted in light grey, with paternal (X1) and maternal 
(C57) methylation values included as an inset for each trackable phase. Values 
for SNP methylation in ‘cleavage’ correspond exactly to those captured in the 
zygote. Blue bars, CpG methylation; red bars, CpA methylation. b, Composite 
plot of CpG (blue) and CpA (red) methylation for all HCPs (left) and for 
promoters that are specifically hypermethylated in oocytes (transcription start 
site (TSS) DMRs, right). The region +2 kb of the TSS is marked in grey. 
Identified promoter DMRs contributed by the oocyte are hypermethylated 
around the periphery of the TSS and resolve to intermediate values throughout 
cleavage. An expected HCP methylation architecture is re-acquired for these 
DMRs around implantation. c, Mean methylation levels across stages for 
oocyte-contributed DMRs in promoters (red, dashed line) versus the complete 
set (red, solid line). d, Sperm-contributed DMRs (blue line) generally resolve to 
hypermethylation. 


during female gametogenesis could possibly be similar to those at work 
during fertilization®*. Regardless, the embryonic pattern more closely 
resembles that of the oocyte until the later stages of pre-implantation, 
where DNA methylation is further decreased. 

In addition to classical ICRs, which exhibit parent-of-origin- 
specific methylation maintained through adulthood, a substantial 
number of CpG island promoters are specifically hypermethylated 
in the oocyte, in agreement with a recent study”*. Surprisingly, these 
regions retain intermediate values indicating differential allelic 
methylation before gradually decreasing through ICM specifica- 
tion and gastrulation, where somatic methylation patterns are re- 
established (Supplementary Fig. 11). 

It remains to be investigated whether the diverse targets that exhibit 
low methylation levels during embryogenesis are the consequence of 
a single regulatory principle. LINE and LTR activity in the early 
embryo is associated with some of the earliest transcriptional events 
during zygotic genome activation. Targeted depletion by antisense 
oligonucleotides of the L1Md_T class as well as certain LTRs suggests 
a general requirement for retrotransposon transcription for progres- 
sion through cleavage***’. These observations may also support data 
indicating that the elongation factor/histone acetyltransferase ELP3 is 
acomponent of the DNA demethylation machinery and could explain 
a tight relationship between complete demethylation and transcription- 
associated complexes”. 

It is likely that current interest in hmC will spur technical improve- 
ments that will permit quantitative dissection of mC and hmC 
patterns, which will help answer remaining questions regarding 
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Tet3’s universal necessity for conversion to unmethylated cytosines, 
as well as the effect hmC may have on Dnmt-mediated inheritance”. 
Tet3’s global conversion to hmC of the paternal genome does not 
seem to lead to equivalently marked demethylation on the basis of 
the retention of bisulphite-detected signal. The feature-specific 
dynamics of DNA methylation at fertilization indicate that Tet3 
and hmC may be required for targeted demethylation, as well as for 
driving a gradual depletion through cleavage””®. Further experiments 
will be required to characterize this division-dependent demethyla- 
tion in more detail, and expand it to regions with lower G+C content 
that are under-represented in RRBS. Notably, other mechanisms must 
retain heritable methylation information because many targets dis- 
play relative epigenetic stability from zygote onward and some of 
these features exhibit embryogenesis-specific methylation patterns. 

Our genome-scale single-base-resolution data provide improved 
understanding of the relationship between DNA methylation and early 
development. This expands earlier models derived from immuno- 
histochemistry-based observations and begins to address remaining 
unanswered questions, setting the stage for future epigenetic studies in 
early mouse development. 


METHODS SUMMARY 


Gametes, cultured cleavage stage embryos, immunosurgically dissected E3.5 
ICM, and post-implantation embryos were isolated as described previously (see 
Methods). Samples were purified through sequential KSOM microdrops 
(Millipore) and polar body contaminants mechanically dissected using XY laser 
(HamiltonThorne) assisted biopsy (Supplementary Fig. 1 and Supplementary 
Movie 1). Reduced representation bisulphite libraries were generated from 
proteinase-K-purified, MspI-digested genomic DNA and sequenced using the 
Illumina Genome Analyzer II platform. Sequenced reads were aligned to the 
Mouse Genome Build 37 (mm9) using a custom computational pipeline that 
accounted for the strain identity of samples, which were either inbred or hybrid 
strains to provide adequate SNP tracking. Sampled cytosines covered =10 were 
used for single CpG analysis. Alternatively, single CpGs were incorporated into 
features taken from ref. 4 or into 100-bp tiles using a 5X threshold. Methylation 
levels reported for a sample is the average across replicates that met these 
threshold criteria. Tiles were considered to show a change between two stages 
if they exhibited a methylation difference =0.2 and statistical significance 
through a t-test after correction for multiple hypothesis testing (FDR <0.05) 
using the Benjamini-Hochberg method. Retrotransposon annotations are from 
the RepeatMasker track of the UCSC genome browser. Novel DMRs were iden- 
tified from a pool of 100-bp tiles where one gamete had a mean methyla- 
tion =0.75 and the other had a mean methylation =0.25. Linear regression 
applied to this set identified tiles that had zygotic methylation values that fell 
halfway between those of oocyte and sperm. SNPs between 129X1/SvJ paternal 
and BDF1 (C57BL/6 x DBA/2) maternal genomes were taken from Mouse 
Genome Informatics and used to assess relative maternal contamination as well 
as to identify the parent of origin in order to track allelic methylation values in 
DMRs and sites exhibiting demethylation. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Preparation of samples. Isolation of gametes, pre- and post-implantation 
embryos was performed using procedures described in detail elsewhere”’. 
Briefly, 4-6-week-old BDF1 female mice (Charles River) were injected with 
5 1U of pregnant mare gonadotropin (Sigma) followed 46 h later by 5 IU human 
chorionic gonadotropin (Sigma). Primed mice were then either directly used to 
collect oocytes or mated with 129X1 male mice (Jackson) to collect fertilized 
embryos. Twelve hours after final hormone injection, oocytes or zygotes were 
isolated from the ampulla under mineral oil and collected in hyaluronidase- 
containing M2 medium (Millipore) drops to eliminate cumulus cells or 
sperm contaminants. Oocytes were then depleted of somatic contaminants via 
progressive dilution through sequential drops of CO buffered, amino-acid- 
supplemented KSOM medium (Millipore) until no somatic contaminants were 
observed. 

Embryos were cultured in KSOM until collection at progressive cleavage stages 
with isolation occurring within 6h of the first observed cleavage event for that 
stage. Zygotes were screened for the presence of visible pronuclei and subjected to 
XY Clone (Hamilton Thorne) laser-assisted polar body biopsy using an 8-j1m 
bore piezo pipette (Humagen, Supplementary Fig. 1 and Supplementary Movie 1). 
Clean cleavage stage embryos underwent an identical approach, with develop- 
mental progression unhindered by biopsy conducted at the 2-cell stage (Sup- 
plementary Fig. 1). For each collection, batches of embryos were carefully screened 
to ensure that each stage did not contain any abnormal embryos. Collection for 
zygotes was timed at ~10h.p.f. with fertilization assumed to occur 6-8h after 
HGG injection, which was again confirmed by the relative synchronicity of the first 
cleavage division and by relative pronuclear stage. Biopsies were conducted in M2 
media (Millipore) in batches of 5-10 embryos to reduce time on the micro- 
manipulator stage. Before the final collection, cleaned and sorted samples were 
washed with acid Tyrode’s solution (Sigma) to eliminate the zona pellucida and to 
deplete any residual somatic contaminants or polar bodies through a short series of 
additional washes. 

Cavitated E3.5 blastocysts were flushed from the uteri of naturally mated mice 
into M2 or DMEM followed by sequential washing in KSOM. The ICM itself was 
enriched from collected blastocysts by treating the embryo with rabbit anti- 
mouse serum (Sigma) before immunosurgical depletion of the trophectoderm 
using guinea-pig complement serum (Sigma). Isolated ICMs were serially washed 
to remove contaminants. 

E6.5 and 7.5 embryos were isolated after mechanical dissection of the decidua 
from the uterine lining of mated mice. Samples were again serially washed and 
peripheral trophectodermal tissues dissected away using fine glass capillaries. 

Swimming sperm samples were isolated in BSA-supplemented human tubule 
fluid (Millipore) from the caudal epididymis of male mice within 5 days of a 
successful natural mating as scored by copulation plug. Sperm were incubated 
in buffered HTF as in in vitro fertilization for over 1h in part to reduce somatic 
contaminants, and samples were scored for relative quality under X10 micro- 
scopy before snap freezing. 

All samples were then collected at minimal volume and either snap frozen or 
immediately re-suspended in DNA lysis buffer. 

Preparation of reduced representation bisulphite sequencing libraries. RRBS 
libraries were generated as described previously”. Briefly, DNA was isolated 
from snap-frozen embryos in DNA lysis buffer (100 mM Tris-HCl (pH 8.5), 
5 mM EDTA, 0.2% SDS, 200 mM NaCl) supplemented with 300 jg ml proteinase 
K (Invitrogen) followed by phenol:chloroform extraction, ethanol precipitation and 
re-suspension in EB buffer. Isolated DNA was then subjected to MspI digestion 
(NEB), end repair using Klenow 3’-5’ exo— (NEB) supplemented with GTP, 
meCTP and ATP in a 1:1:10 ratio to facilitate 3’ A tailing, and ligation of standard 
adapters using ultraconcentrated 10°U T4 DNA ligase (NEB) and extended 20h 
ligation at 16°C. Size selection of 40-150 and 150-270-bp fragments containing 
ligated adaptor was conducted by extended gel electrophoresis using NuSieve 3:1 
agarose (Lonza) and gel extraction (Qiagen) using 50ng dephosphorylated, 
sonicated Escherichia coli DNA asa protective carrier and to increase overall yield. 
The isolated molecular weight fractions in a given RRBS library were then sepa- 
rately treated with sodium bisulphite using the EpiTect bisulphite conversion and 
column purification system (Qiagen) with a modified conversion strategy as 
described”. After clean up, the optimal, minimum PCR cycle number required 
to generate the final libraries was gauged using diagnostic PCRs for each library. 
Final libraries were then generated from the complete bisulphite converted pool 
and purified through a second round of gel electrophoresis. High- and low- 
molecular-mass fragments were then either sequenced separately or pooled at a 
2:1 ratio by mass to assume an equimolar representation of both size ranges. 
Libraries were then sequenced on an Illumina Genome Analyzer II before align- 
ment and analysis. The sequencing reads were aligned to the Mouse Genome 
Build 37 (mm9) using a custom computational pipeline taking into account the 
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strain background for each sample***. To supplement our data set we included 
sperm replicate 2 from ref. 25 (SRA accession ERP000689). 

Estimating methylation levels. The methylation level of each sampled cytosine 
was estimated as the number of reads reporting a C, divided by the total number 
of reads reporting a C or T. Single CpG methylation levels were limited to those 
CpGs that had at least tenfold coverage. For 100-bp tiles, reads for all the CpGs 
that were covered more than fivefold within the tile were pooled and used to 
estimate the methylation level as described for single CpGs. The CpG density for a 
given single CpG is the number of CpGs 50 bp up- and downstream of that CpG. 
The CpG density for a 100-bp tile is the average of the CpG density for all single 
CpGs used to estimate methylation level in the tile. CpA methylation levels were 
estimated in the same way as for CpG methylation. 

The methylation level reported for a sample is the average methylation level 
across replicates. A replicate will contribute to the average only if it meets the 
coverage criteria within the replicate. Technical replicates were averaged before 
contributing to the sample average. 

Genomic features. High-density CpG promoters (HCP), intermediate-density 
CpG promoters (ICP), low-density CpG promoters (LCP), TSS, CpG island, and 
DMR annotations were taken from ref. 4. Promoters are defined as 1 kb up- and 
downstream of the TSS. LINE, LTR and SINE annotations were downloaded from 
the UCSC browser (mm9) RepeatMasker tracks. Gene annotations were down- 
loaded from the UCSC browser (mm9) RefSeq track. In each case, the methyla- 
tion level of an individual feature is estimated by pooling read counts for all CpGs 
within the feature that are covered greater than fivefold, and levels are only 
reported if a feature contains at least 5 CpGs with such coverage (in contrast to 
100-bp tiles where no minimum number of CpGs is required). A tile is annotated 
as a genomic feature if any portion of the tile overlaps with the feature and may be 
annotated by more than one feature (for example, the same tile can be annotated 
as both a promoter and a gene). 

Identification of tiles with changing methylation levels and their enrichments. 
A tile is considered changing if it both has a methylation difference =0.2 between 
two stages and is significant in a two-sample t-test with unequal variance after 
correction for multiple hypothesis testing (FDR <0.05) using the Benjamini- 
Hochberg method”. Enrichment P values are from the hypergeometric distri- 
bution where the background is the number of tiles that have a methylation 
difference =0.2 and are corrected for multiple hypotheses at FDR <0.05, based 
on the number of feature sets tested. 

Identification of enriched retrotransposon families. The same procedure for 
identifying changing tiles was applied to the methylation levels of retrotransposon 
elements to identify changing elements. Enrichment for families was done using 
annotations from the RepeatMasker track of the UCSC genome browser. 

Novel DMR identification. 100-bp tiles where one gamete had a mean methyla- 
tion greater than 0.75 and the other gamete had a mean methylation of less than 
0.25 were flagged as potential DMRs. Linear regression was used to identify tiles 
which had methylation levels in zygote which were halfway between the methyla- 
tion levels in oocyte and sperm. Only tiles that had two replicates present in each 
time point were considered. Residuals were calculated as the mean of the differ- 
ences between the model predictions and the data taking into account missing 
values. ANOVA was used to assign a P value to each tile. A tile was considered a 
novel DMR if it had a residual in the tenth percentile of tiles tested and a signifi- 
cant P value from ANOVA with a Benjamini-Hochberg FDR <0.05. A residual 
in the tenth percentile corresponds to an FDR <0.1 by a permutation test where 
zygote methylation values are shuffled for potential DMR tiles. In the pie charts 
(Fig. 5b, d), the genomic feature that covered the most novel tiles was reported 
first and then subtracted from the set before reporting the feature which covered 
the next largest number of tiles. This procedure was repeated until all tiles were 
categorized. The one exception was for oocyte-contributed DMRs where promoters 
were taken out before genes. 

Identification of SNPs. An initial set of SNPs between 129X1 and BDF1 (C57BL/ 
6 X DBA/2) was taken from Mouse Genome Informatics”®. The set was filtered 
such that SNPs that fell into the following categories were removed: (1) SNPs that 
had inconsistent entries for the same position; (2) SNPs not trackable by RRBS 
(C/T or A/G); (3) SNPs between C57BL/6 and DBA/2; and (4) SNPs that were not 
covered by X1 and BDF1 in an in silico digest. The log odds ratio [log>(X1 count 
+0.01/C57 count+0.01)] was calculated for each SNP that was covered in the 
data set (n = 786). SNPs that had at least 10X coverage with an average log odds 
ratio across all replicates between —5 and 5 and a sperm X1 log odds ratio greater 
than 2 were considered to be of stringent quality (n = 636) and used to assess both 
maternal bias and to serve as a general quality control metric for all libraries 
incorporated into the data set. 

Parent-of-origin methylation tracking. The 636 SNPs identified above corre- 
sponded to 1,674 CpG dinucleotides and were used to track allelic single CpG 
methylation. Reads were segregated into either X1 or BDF1 backgrounds according 
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to SNP type, and CpG methylation levels were called in the same manner described 53. Gu, H. et al. Preparation of reduced representation bisulfite sequencing libraries 

above. SNP normalized methylation values (Supplementary Fig. 4) are the average for ganome-scale DNA methylation profiling. Nature Protocols 6, 468-481 (2011). 
k ‘ 54. Gu, H. etal. Genome-scale DNA methylation mapping of clinical samples at single- 

of the methylation values from each strain. nucleotide resolution. Nature Methods 7, 133-136 (2010). 

55. Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and 


51. Nagy, A. Manipulating the Mouse Embryo: a Laboratory Manual 3rd edn (Cold Spring powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289-300 (1995). 
Harbor Laboratory Press, 2003). 56. Blake, J.A., Bult, C.J., Kadin,J.A., Richardson, J. E. & Eppig, J. T. The Mouse Genome 
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The 2.8 A crystal structure of the dynein 


motor domain 


Takahide Kon!?, Takuji Oyama!, Rieko Shimo-Kon!, Kenji Imamula*, Tomohiro Shima*, Kazuo Sutoh® & Genji Kurisu’? 


Dyneins are microtubule-based AAA* motor complexes that power ciliary beating, cell division, cell migration and 
intracellular transport. Here we report the most complete structure obtained so far, to our knowledge, of the 380-kDa 
motor domain of Dictyostelium discoideum cytoplasmic dynein at 2.8 A resolution; the data are reliable enough to discuss 
the structure and mechanism at the level of individual amino acid residues. Features that can be clearly visualized at this 
resolution include the coordination of ADP in each of four distinct nucleotide-binding sites in the ring-shaped AAA* 
ATPase unit, a newly identified interaction interface between the ring and mechanical linker, and junctional structures 
between the ring and microtubule-binding stalk, all of which should be critical for the mechanism of dynein motility. We 
also identify a long-range allosteric communication pathway between the primary ATPase and the microtubule-binding 
sites. Our work provides a framework for understanding the mechanism of dynein-based motility. 


Dyneins are large microtubule-based motor complexes responsible for 
various biological movements’. Cytoplasmic dyneins power diverse 
cellular processes, such as cell division, cell migration and minus- 
end-directed intracellular transport of vesicles and organelles along 
microtubules**. Axonemal dyneins are specialized to drive ciliary 
and flagellar beating>®. 

Unlike the other families of cytoskeletal motor proteins—myosin and 
kinesin, the heavy chains of which are evolutionarily close to the G 
protein families—the dynein heavy chains (molecular mass =500 kDa) 
belong to the AAA™ superfamily of mechanochemical enzymes’. 
Members of this ATPase family participate in diverse cellular functions 
including protein degradation, protein-complex disassembly, and DNA 
replication and recombination®. Nevertheless, AAA™ proteins share 
structural features: they contain conserved ATPase units called AAA* 
modules, and they operate as oligomers in which typically six AAA* 
modules assemble into a ring structure’. Although in dynein six distinct 
AAA™ modules (AAA1-AAA6) are concatenated into a single polypep- 
tide’, their overall arrangement is similar to that observed in other ring- 
shaped AAA proteins'”"”. Therefore, dynein probably shares common 
mechanistic features with AAA* ring ATPases rather than with the 
G-protein-related motors. In addition to the ATP-hydrolysing AAA* 
ring, the dynein heavy chain contains two major functional units: the 
stalk and linker. The stalk has an ATP-sensitive microtubule-binding site 
(MTBD) at its tip’*"*, whereas the linker has been proposed to function 
as a mechanical element for generating dynein’s power stroke'®’*"». 

The molecular mechanism of action of dynein is still poorly under- 
stood compared with that of other cytoskeletal motors and major 
AAA‘ proteins, which is partly due to the lack of high-resolution 
structural information. Most recently, two crystal structures of the 
motor domain of cytoplasmic dynein were reported at 4.5-6 A reso- 
lution’®’”’, revealing the dynein structures at the secondary structure 
level. However, a much higher-resolution analysis is required to reveal 
the complete atomic structure of the motor domain, which is critical 
for understanding the structural basis of dynein’s force generation 
and movement. Here we report the crystal structure of the motor 
domain of Dictyostelium cytoplasmic dynein at 2.8 A resolution, with 
approximately 90% of its amino acid residues traced. 


Overall structure of dynein motor domain 


We crystallized the wild type and a truncation mutant (AMTBD) of 
the 380-kDa motor domain (residues V1388-14730 of the heavy 
chain) of Dictyostelium discoideum cytoplasmic dynein'*"” (Fig. 1a). 
AMTBD lacks the small MTBD (residues A3372-K3495), but retains 
ATPase activity (111 +6 s ') comparable to the maximal rate (k.at) 
of microtubule-activated ATPase in the wild-type motor domain”. 
For AMTBD, the structure was solved using three heavy-atom deri- 
vatives. The resulting electron-density maps were of sufficient quality 
(Supplementary Fig. la—f) to create a nearly complete atomic model 
(Fig. 1b), except for the 22 amino-terminal residues, disordered loops, 
and truncated MTBD (Supplementary Methods). The model was 
refined to 2.8A resolution (Rwork/Riree Of 26.3/32.1%; Supplemen- 
tary Table 1). For wild-type dynein motor domain, the structure was 
solved by molecular replacement using the AMTBD structure as a 
search model, and the model including MTBD was refined to 3.8 A 
resolution (Rwork/Rfree Of 22.1/29.4%; Fig. 1c and Supplementary Table 
1). In both structures, the crystallographic asymmetric unit has two 
motor molecules (AMTBD: AMTBD-A and AMTBD-B; wild type: 
WT-A and WT-B; Supplementary Fig. 1g, h). The four molecules 
are similar in overall structure, but have several distinct differences 
as described below, indicative of dynamic structural changes of the 
motor domain. Here we describe the monomeric motor domain struc- 
ture by referring mainly to AMTBD-A, because it provides the most 
complete model. 

The motor domain comprises the AAA” ring and three additional 
structural units: the linker, stalk/strut and C sequence (Fig. 1b, c and 
Supplementary Fig. 2). The central ATP-hydrolysing ring has six 
AAA* modules arranged around the ring central pore, and is 130 A 
in diameter, 65 A in thickness and 35 A in the pore diameter (Sup- 
plementary Fig. 3a). Each AAA* module contains the N-terminal 
a/B-submodule and the carboxy-terminal «-submodule (Supplemen- 
tary Fig. 4) that constitute the front and back faces of the ring, respec- 
tively (Supplementary Fig. 3a). The «/B-submodule core exhibits a 
Rossmann-type fold (Supplementary Fig. 4a), which is commonly 
found in nucleotide-binding proteins*’*. AAA1-AAA4 have unique 
insertions in their «/f-submodules: H2, pre-Walker B and pre-sensor 
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Figure 1 | Overall structure of the dynein motor domain. a, Sequence 
diagrams of dynein heavy chain and wild-type and AMTBD motor domains, 
showing tail, linker and six AAA* modules constituting ring, stalk-strut and 


I (PS-I) inserts (Supplementary Fig. 4b). AAA5 contains an additional 
globular structure (named here the AAAS extension) that contains 
eight a-helices (H9-H16) and protrudes from the back face of the ring 
(Supplementary Fig. 3a). 

Above the front face of the ring, the linker exists as a rod-like 
structure that comprises nineteen o-helices and eight B-strands and 
can be partitioned into five subdomains from 0 to 4 (Supplementary 
Fig. 3b). This structure extends from AAA1-c/ above the ring to the 
periphery of AAA4 (Fig. 1b, c). Between AAA4 and AAAS, two 
prominent coiled coils protrude and form a Y-shaped structure 
(Fig. 1b, c and Supplementary Fig. 3c). The longer coiled coil extend- 
ing from AAA4 is the stalk and the shorter one extending from AAA5 
is the strut (the buttress) that was identified in previous X-ray studies’*””. 
The WT-A stalk includes MTBD at the tip, whereas in the WT-B stalk, 
MTBD and its distal portion (K3359-K3512) are missing, probably 
due to structural flexibility of the stalk. On the ring’s back face, the C 
sequence exists as a complex structure comprising six u-helices and an 
incomplete six-stranded antiparallel B-barrel (Supplementary Fig. 3d). 
The shape of the C sequence is distinctively flat, spreading over the 
a-submodules of AAA1, AAAS and AAA6 (Fig. 1b). 

This overall structure is similar at the secondary structure level to 
the medium-resolution structures'®'” (see Supplementary Fig. 5 for 
comparison with our previous 4.5 A model, which corrects errors in 
assignment in the previous study’”). However, unlike previous models, 
our structure is reliable enough to discuss the dynein structure and 
mechanism at the level of individual amino acid residues, as judged 
from the reasonable refinement statistics at 2.8 A resolution and result- 
ing electron-density maps. 


The four nucleotide-binding sites 


The motor domain has four ADP molecules bound to the first four 
AAA* modules (Fig. 2a), in agreement with a previous biochemical 
study**. Each nucleotide-binding site is formed at the interface 
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AAAS extension , 


C-sequence. The colour code shown here is used for all figures, unless otherwise 
noted. b, c, Ribbon representation of AMTBD (b) and wild-type (c) motor 
domain structures. 


between three submodules, «/B and « of one module and «/B of the 
adjacent module (Fig. 2b-e and Supplementary Fig. 6), with a struc- 
tural organization very similar to that of other AAA* ATPases??!””, 
At the AAAI nucleotide-binding site (Fig. 2b), the phosphate moiety 
of ADP is surrounded by the Walker-A motif (G1977-T1981) in 
AAA\1-z/B, whereas the ribose moiety is recognized by the sensor-I] 
motif (R2150) and neighbouring residues (L2149 and K2153) in 
AAAI-«. The adenine base is accommodated by hydrophobic residues 
in AAA1-o (12108) and the N loop (L1947 and V1948) connecting 
the linker with the AAA1 ATPase site (Fig. 2a, b and Supplementary 
Fig. 6a). Given that ATP is the only energy source for dynein motor 
function” although it can hydrolyse other nucleotides”, the adenine 
base recognition by the N loop is probably involved in the coupling of 
the linker actions and ATPase. Residues in Walker-B (D2026 and 
E2027) and sensor-I (N2078) motifs of AAA1-«/B, which are generally 
required for ATP hydrolysis, occupy the nucleotide-binding site. In 
addition, an arginine finger (R2410) crucial for ATP hydrolysis 
stretches from the adjacent AAA2-a/B and points to the bound ADP. 

Among the AAAI-AAA4 nucleotide-binding sites with similar 
structural frameworks, AAA2 shows several distinct characteristics 
(Fig. 2c). First, the crucial sensor-I] motifis missing and L2531 occupies 
the position. Second, in the Walker-B motif, the conserved glutamic 
acid essential for ATP hydrolysis is replaced by glycine (G2357). 
Moreover, two arginine fingers (R2806 and R2809) from the adjacent 
AAA3 module interact extensively with the phosphates of bound ADP. 
The unique nucleotide-binding mode of AAA2 indicates that this site 
binds ADP and/or ATP but does not hydrolyse ATP. In contrast to 
AAA2, most functional motifs of AAA3 and AAA4 required for ATP 
binding and hydrolysis are oriented in a canonical manner (Fig. 2d, e), 
indicating that dynein has three ATP hydrolysis sitesp—AAA1, AAA3 
and AAA4. This was further supported by pre-steady-state kinetic ana- 
lysis of phosphate release from the motor domain carrying multiple 
mutations in the three nucleotide-binding sites (Supplementary Fig. 7 
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Figure 2 | Structure of the four nucleotide-binding sites. a, Locations of 
bound ADP in the ring. ADP molecules are shown as space-filled models. The 
right panel shows the bottom view of the ring and linker. The N loop connects 
the AAAI nucleotide-binding site and linker. b-e, Magnified views of ADP in 
the nucleotide-binding sites. ATPase motifs, that is, Walker-A, Walker-B, 
sensor-I, sensor-II and arginine-finger, are abbreviated as WA, WB, SI, SII and 
RF, respectively. Residues that are hydrogen-bonded to ADP are shown. In 
panel b, residues participating in ribose moiety (L2149) and adenine ring 
(12108 and L1947) accommodation are included. In panel c, L2531 occupying 
the AAA2 sensor-II position is shown. Other residues forming hydrophobic 
interactions with ADP are omitted for clarity. Hydrogen bonds to ADP are 


drawn with grey dotted lines. Distances between the arginine finger Nn and 
ADP £-phosphate are indicated by black dashed lines. 


and Supplementary Table 2). Similar kinetic experiments performed 
with microtubules showed that among the three ATPase sites, AAAI 
alone drives microtubule-activated ATPase essential for dynein func- 
tion (Supplementary Fig. 7 and Supplementary Table 2). 


Open and closed ring conformations 


The ring structure of AMTBD-A displays a unique arrangement of six 
AAA* modules (upper, Supplementary Fig. 8a). The six o-submo- 
dules are almost equally arranged in pseudo-six-fold symmetry. In 
contrast, the six o:/$-submodules exhibit an asymmetrical arrange- 
ment and can be grouped into AAA5-AAA6-AAAI, forming one 
loosely packed half- ring structure with buried inter-modular surfaces 
of approximately 200 A’, and AAA2-AAA3-AAA4, comprising the 
more tightly packed half-ring structure with buried inter-modular 
surfaces of 500-1,000 A”. There is no direct contact between the 
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two half-rings, and, as a result, apparent gaps are formed between them. 
This asymmetrical arrangement is different from that previously 
reported for the yeast motor domain in a no-nucleotide state, in which 
gaps exist between AAA1-AAA2 and between AAA5-AAAG6 instead’®. 

The AAAI-AAA2 and AAA4—AAAS gaps lie in the AAAI] and 
AAA4 nucleotide-binding sites, respectively, affecting the positioning 
of the trans-acting arginine finger critical to ATP hydrolysis in many 
ATPases”’. With these open gaps, the arginine fingers extending 
from AAA2 to AAAI (R2410) and from AAA5 to AAA4 (R3780) 
are positioned far away from the bound ADP (Fig. 2b, e and 
Supplementary Fig. 8a) to allow ATP hydrolysis when bound to these 
nucleotide-binding sites. Given that ATP is hydrolysed at AAAI and 
AAAA, the gaps are expected to close during their ATPase cycles so 
that arginine fingers can interact with the y-phosphate of bound ATP. 
Indeed, the WT-A ring structure has a more closed conformation 
(lower, Supplementary Fig. 8a). This closure shortens the distance 
between the arginine finger and bound ADP (Supplementary Fig. 8a, b), 
which would allow R2410 and R3780 to participate in ATP hydrolysis. 
Although AMTBD and the wild-type dynein motor domain are in the 
same nucleotide-bound state, the structural differences suggest the 
existence of an ATPase-dependent open/closed transition of the ring, 
as observed for other AAA’ ATPases during their mechanochemical 
actions”””*. 


Linker and ring interactions 


To understand the mechanism of dynein motility, it is critical to 
elucidate the structural details of interactions between the ATP- 
hydrolysing ring and the linker whose swing-like motions have been 
proposed to be the major contributor to dynein’s power stroke'*’*”». 
In the present structure, the linker position is close to that of the post- 
power-stroke state inferred from electron microscopy analysis’’. 
When viewed from the ring’s front face, the linker appears to bridge 
AAAI and AAA4 by spanning the inner edge of AAA2, AAA3 and 
AAA4 (Fig. 1b, c). However, the linker interacts with the ring only 
through two major interfaces at AAA1-AAA6 and AAA2 (Fig. 3a). 
The first interaction interface is at the linker base connecting to 
AAAI1-a/B (orange in Fig. 3a). Here, linker subdomains 3 and 4 are 
tightly sandwiched between AAA1-a/B, AAAI-a, and AAA6-a 
through four hydrogen bonds and hydrophobic interactions invol- 
ving 64 residues. The resulting buried surfaces are sufficiently wide 
(2,478 A) for stable interactions between the linker base and ring. 
The second interface is at the middle region of the linker (cyan in 
Fig. 3a). Here, two finger-like structures—the H2 insert B-hairpin and 
PS-I insert loop—protruding from AAA2 interact with linker sub- 
domains 2 and 3, respectively (Fig. 3b and Supplementary Fig. 9): 
the H2 insert tip forms hydrophobic interactions with B1, 82 and 
the H7 helix in linker subdomain 2, and the PS-I insert tip forms a 
hydrophobic interaction with the H13 helix in linker subdomain 3. 
Several hydrophobic interactions are formed between H2 and PS-I 
inserts, indicating that these two finger-like structures function 
together to support the linker structure. 

To examine the functional significance of H2 and PS-I inserts, we 
performed functional analysis of a mutant (AH2I) in which the H2 
insert B-hairpin (S2319-D2329) was replaced with a flexible linker 
(GG). With this mutation, all H2 insert residues interacting with the 
linker and PS-I insert were deleted or replaced (Fig. 3b). AH2I exhibited 
clear microtubule-activated ATPase activity and ATP-induced change 
in the affinity for microtubules, although at lower levels than those in 
wild type (Fig. 3c and Supplementary Fig. 10a). However, steady-state 
fluorescence resonance energy transfer (FRET)-based analysis’* did not 
detect ATPase-dependent linker motions in the mutant (Fig. 3d), indi- 
cating the critical involvement of H2/PS-I inserts in linker swing 
actions. The FRET signal reporting the linker position suggests that 
the linker may adopt an intermediate position between unprimed (post- 
power-stroke) and primed (pre-power-stroke) positions regardless of 
the nucleotide state. 
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Figure 3 | Linker-ring interactions. a, Open-book presentation of linker- 
ring interactions in AMTBD-A. First and second interaction interfaces are 
surrounded by dotted circles in orange and cyan, respectively. The secondary 
structure elements involved in the interactions are shown in cartoon 
representation. b, Side view of the motor domain showing linker-AAA2 
interactions in AMTBD-A. Interacting residues between linker subdomain 2/3 
and H2/PS-I inserts and between H2 and PS-I inserts are shown. D2329 and 
E2383 form a hydrogen bond network through R2337 (not shown). Truncation 
positions ($2319 and D2329) in AH2I are indicated. c, Microtubule-activated 
ATPase rates of the motor domain. Each symbol indicates mean + s.d. (n = 3). 
d, FRET efficiency between BFP and GFP moieties in the motor domain in the 
presence of the indicated nucleotide. Vi, vanadate. The high and low FRET of 
wild type represent linker positions at primed and unprimed states, 
respectively’*. Mean + s.d. (n = 3) are shown. 


The conventional linker swing model assumes that in the unprimed 
state, the N-terminal region of the linker docks on AAA4 or AAA5 
with its C-terminal base connected to AAAI, and that ATPase- 
induced structural changes propagate from AAA1 to AAA4-AAA5 
to release the docked linker end and induce the linker swing'®'*'°. 
Here we propose an alternative model, on the basis of our findings 
that there is no direct contact between the linker and AAA3-AAAS in 
our structure (Fig. 3b and Supplementary Fig. 9) and that H2 and PS-I 
inserts of AAA2 have a critical role in linker swing (Fig. 3d). Given 
that H2 and PS-I inserts in other AAA” proteins serve as interaction 
interfaces with target proteins or DNA to catch and remodel the 
targets**”°, dynein could also use this remodelling apparatus for the 
linker swing. During the mechanochemical cycle of dynein, H2 and 
PS-T inserts interacting with linker subdomains 2 and 3 could remodel 
the junction of the two subdomains so that the linker bends there and 
swings against the ring. In support of the model, the subdomain 
junction is the thinnest part of the linker (Fig. 3a, b) and therefore 
is the best candidate for the bending site among subdomain junctions 
along the linker. The open-closed transition of the AAA1-AAA2 
interface during the primary AAAI ATPase cycle (Supplementary 
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Figure 4 | Structure and function of stalk-strut and C sequence. a, Knobs- 
into-holes side-chain packing in stalk-strut coiled coils of AMTBD-A and 
AMTBD-B. Side chains of residues, assigned as knobs, are shown as red space- 
filled models. b, Open-book presentation of stalk-strut interactions in 
AMTBD-A. Residues involved in interactions are shown. Truncation positions 
(Q3824 and E3864) in ASD are indicated. c, Microtubule-activated ATPase 
rates. Each symbol indicates mean + s.d. (n = 3). The grey dotted lines indicate 
wild-type rates. d, Microtubule-binding activity with or without ATP. The 
smooth curves are the best fits to hyperbolas; wild type (dotted lines) exhibited 
weak (Ka >10 1M) and strong (Kg ~0.4 1M) binding to microtubules in the 
presence and absence of ATP, respectively, whereas ASD and AC-seq showed 
weak MT binding (Kg >10 1M) regardless of the presence of ATP. e, Back side 
view of the motor domain, showing the C-sequence H1 helix that interacts with 
the H6 helix of AAA5 connected to the strut. 


Fig. 8) may directly affect the position and/or orientation of H2 and 
PS-I inserts protruding from AAA2 and thereby may drive the linker 
swing. 


The allosteric communication pathway 


Another key question regarding the dynein mechanism is how the 
motor coordinates microtubule binding at MTBD with the primary 
ATPase at AAA1, which must be communicated over a long distance 
through several structural units including the stalk coiled coil. In the 
present structures, the stalk—strut exhibits the most significant struc- 
tural variation between the four molecules. AMTBD-A has a straight 
stalk made of a canonical antiparallel coiled coil along its entire length 
and a strut without canonical coiled-coil packing (left, Fig. 4a and 
Supplementary Fig. 11). In contrast, AMTBD-B possesses a kinked 
stalk with a shorter coiled-coil region and a strut with canonical 
coiled-coil packing (right, Fig. 4a and Supplementary Fig. 11). In 
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WT-A, the distal portion of the inward helix of the stalk appears to 
melt just below the junction to MTBD, in contrast to the fully folded 
stalk in AMTBD (Supplementary Fig. 12). These structural differ- 
ences suggest that the stalk-strut structure has a high degree of 
molecular plasticity that may be relevant to its function of relaying 
two-way allosteric communication between MTBD and the ring: the 
structural information could be propagated through the local melting 
and/or tilting of the stalk, in addition to the recently proposed ‘helix- 
sliding mechanism’*?*?. 

The distal end of the strut (residues 3831-3862) interacts with the 
centre of the stalk (residues 3284-3302 and 3556-3574) to form a 
four-helix bundle that is stabilized by three hydrogen bonds and 
hydrophobic interactions involving 29 residues with a wide inter- 
action interface of approximately 800 A* (Fig. 4b). The extensive 
interactions suggest that the strut has a key role in the stalk-mediated 
allosteric communication. To examine this, we performed functional 
analysis of a strut mutant (ASD) generated by replacing its distal 
portion (Q3824-E3864) with a flexible linker (GGGG). With this 
mutation, the interaction interfaces of the strut with the stalk were 
completely removed (Fig. 4b). This mutant showed microtubule- 
independent, high ATPase activity that is relatively close to the kat 
value of wild-type microtubule-activated ATPase activity (~90s 5 
Fig. 4c) and exhibited an ATP-insensitive, weak microtubule-binding 
affinity (dissociation constant (Ka) >10 1M; Fig. 4d). The results 
indicate that the communication between MTBD and the AAAI 
ATPase site was disrupted by the mutation, and highlights the import- 
ance of stalk-strut interactions for allosteric communication. 

We next examined how structural changes propagate from the 
stalk-strut to the primary AAA1 ATPase site. We previously pro- 
posed involvement of the C sequence’’. To test this and to identify 
the functionally crucial C-sequence segment, we designed a deletion 
mutant (AC-seq) lacking most of the C sequence (S4416-14730) based 
on the present high-resolution structure (Fig. 4e and Supplementary 
Fig. 13). This mutant showed microtubule-independent, low ATPase 
activity close to the wild-type basal rate and ATP-insensitive, weak 
microtubule-binding affinity (Fig. 4c, d and Supplementary Fig. 10b), 
illustrating the importance of the C sequence for allosteric commun- 
ication. Together with our previous finding that a C-terminal segment 
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Figure 5 | Proposed mechanism underlying dynein’s motor actions. On the 
front face of the ring (left), the linker swing for dynein’s power stroke is driven 
by a mechanism similar to remodelling actions observed in other AAA* 
ATPases. The primary AAA1 ATPase cycle-driven open-closed transitions of 
the AAA1-AAA2? interfaces affect the position and/or orientation of H2 and 
PS-I inserts of AAA2; the structural changes then induce linker bending 
motions, possibly around the subdomains 2/3 junction, leading to the linker 
swinging against the ring. On the ring’s back face (right), the long-range two- 
way allosteric communication between AAAI and MTBD is relayed through 
dynein-specific structural units (C-sequence, strut and stalk) to couple the 
primary ATPase cycle with regulation of microtubule-binding affinity. 


ARTICLE 


of the C sequence (S4448-14730) is dispensable for dynein’s motor 
activity*’, the results indicate that the remaining H1 helix (E4412- 
D4439) is crucial for communication (Fig. 4e and Supplementary 
Fig. 13). Indeed, the H1 helix forms a structural bridge between the 
AAA1-c/B-AAA6-a segment and the basal portion of the strut 
(Fig. 4e and Supplementary Fig. 13): this helix extends from AAA6- 
a adjacent to AAA1-a/B, forms a sandwiched structure with AAA5-a 
and AAAS5 extension, and makes direct contacts with the H6 helix of 
AAAS from which the strut extends towards the stalk. Collectively, 
our structure and functional analyses suggest that the stalk, strut and 
C sequence are involved in the allosteric communication pathway 
between MTBD and the AAA1 ATPase site. 


Concluding remarks 


The dynein motor domain structure at 2.8 A resolution reveals the 
unique composite architecture of the AAA’ ATPase ring and dynein- 
specific force-generating units (linker, stalk—strut and C-sequence). It 
also provides new insights into how dynein harnesses the unique 
architecture to generate force and movement along microtubules. 
We propose that dynein uses the two distinct molecular machineries 
embedded on the different ring faces to execute the two fundamental 
motor actions required for dynein motility: driving of the linker swing 
for the power stroke and allosteric regulation of MTBD for modu- 
lation of microtubule-binding affinity (Fig. 5). On the ring’s front face, 
the linker swing could be driven by similar mechanisms as those of 
remodelling actions in other AAA’ ATPases through the ATPase- 
induced open-closed transition of the interfaces between the adjacent 
AAA1-AAA2 modules and the critical participation of the remodel- 
ling apparatus, H2 and PS-I inserts of AAA2. On the ring’s back face, 
the long-range allosteric communication between AAAI and MTBD 
required for coupling of the ATPase cycle with the switching of micro- 
tubule affinity could be relayed through dynein-specific structures, 
that is, stalk, strut and C sequence. The general AAA* ATPase-type 
and dynein-specific actions coordinated by the primary ATPase cycle 
at AAAI would drive dynein motility. 


METHODS SUMMARY 


For crystallization, we used HF380 in which a His,—Flag tag was attached to the 5’ 
terminus of the 380-kDa gene. To create AMTBD, the MTBD coding region 
(A3372-K3495) of HF380 was replaced by a linker sequence (encoding TG). 
To produce AH2I and ASD, H2I (S2319-D2329) and strut (Q3824-E3864) cod- 
ing regions were replaced by a linker sequence (encoding GG and GGGG for 
AH2I and ASD, respectively). To construct AC-seq, the 3’-terminal region 
encoding $4416-14730 was removed. Expression and purification of the motor 
domains were performed as described'””. 

Crystallization of the motor domain was performed essentially as described’”. 
All X-ray diffraction data were collected at 100 K on the beamline BL44XU of 
SPring-8, Harima, Japan. Phase calculations proceeded successfully with the 
autoSHARP pipeline**. An atomic model of AMTBD was built on the density- 
modified electron-density map and refined at 2.8 A resolution. The wild-type 
structure was determined using the molecular replacement method with the 
model of AMTBD. MTBD of chain A was clearly seen in the electron-density 
map, and a homology model of MTBD was located in the density. The atomic 
coordinates of wild type were further refined by CNS* at 3.8 A resolution with the 
assistance of deformable elastic network (DEN) restraints*°. 

ATPase rates”, microtubule binding?’ and FRET’® were measured as 
described. Phosphate release was measured using a PBP assay** with an SX- 
18MV stopped-flow apparatus (Applied Photophysics). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein engineering, expression and purification. All constructs were generated 
using standard polymerase chain reaction (PCR)-based cloning methods. We used 
the 380-kDa motor domain (V1388-1I4730; the numbering is based on 
DDB0185096 in dictyBase and is different from the numbering we used previ- 
ously)'*'? of the cytoplasmic dynein heavy chain isolated from D. discoideum Ax2 
strain. For crystallization, we used the HF380 construct in which a His,—Flag tag 
(encoding MTRHHHHHHGGGDYKDDDDKGGGK) was attached to the 5’ 
terminus of the 380-kDa gene. We used the HFB380 construct*’ for phosphate 
release measurements and the HFG380B2 construct'’ for ATPase, microtubule- 
binding and FRET measurements. To create the AMTBD mutant, the MTBD 
coding region (A3372-K3495) of HF380 was replaced by a linker sequence (encod- 
ing TG). To produce the AH2I and ASD mutants, the H2I coding region (S2319- 
D2329) and the strut coding region (Q3824-E3864) of HFG380B2 were replaced 
by a linker sequence (encoding GG and GGGG for AH2I and ASD, respectively). 
The previously used AC mutant’* (V1388-14324) has been found to have the 
C-terminal truncation beyond the boundary between AAA6 and _ the 
C-sequence; we therefore re-designed the AC-seq mutant (V1388-E4415) by 
removing the 3 -terminal region of the 380-kDa gene encoding $4416-14730 from 
HFG380B2. Each of the constructs was subcloned into the MB38 plasmid”. After 
verification of the nucleotide sequence of the PCR-amplified regions, each of 
the expression constructs was introduced into D. discoideum cells; subsequently, 
the transformed cells were selected, cultivated and collected as described!’. For 
the production of the L-selenomethionine-containing motor domain, the 
D. discoideum cells were cultivated in HL5 medium (ForMedium) supplemented 
with 101g ml * blasticidin S, 10g ml’ G418 and 121g ml * tetracycline at 
22°C with shaking at 1501-p.m. until the cell density reached ~1 X 10’ cells 
per ml; collected by centrifugation at 1,320g for 5 min; re-suspended in SIH 
medium without methionine (ForMedium) supplemented with 100,gml ' 
L-selenomethionine; and then cultivated further at 22°C with shaking at 
150r.p.m. for ~24h. The expressed motor domain was purified as described'””. 
For crystallization, the protein was concentrated to 5.25 mg ml _' and supplemented 
with 3mM ADP. 7-Diethylamino-3-((((2-maleimidyl)ethyl)amino)carbonyl)- 
coumarin-labelled phosphate-binding protein (MDCC-PBP) was prepared as 
described**. Protein concentrations were determined by the Bradford method 
standardized with bovine serum albumin. The microtubule concentration is 
expressed as the tubulin dimer concentration. 
Crystallization and structure determination. Crystallization of the motor 
domain was performed as described’’. For cryo-protection, the crystals were 
transferred into a start solution (5 mM Bis-tris propane (pH 7.0), 7.5mM Tris 
(pH 8.0), 2mM MgCl, 8% PEG 8000, 1.5mM ADP), and the concentration of 
PEG 400 was gradually increased from 0% to 30% in 5% increments using the 
soaking method. The crystals were then flash-frozen in a cold nitrogen gas stream 
at 100K. For TagBry. and PW, O49 derivatization, the crystals were cryo- 
protected and then soaked in the cryo-protection solution containing 0.4mM 
TagBr, and 0.03 mM PW,,O,o; respectively, for 3 days at 277 K before freezing. 
All X-ray diffraction data were collected at 100 K on the beamline BL44XU of 
SPring-8, Harima, Japan. Native data were measured at the wavelength of 
0.9000 A, and the TagBrj9-, PW 12O.4o- and L-selenomethionine-containing heavy 
atom derivative data were measured at the wavelength of 1.2547, 1.2142 and 
0.9790.A, respectively. X-ray diffraction images were processed using the 
HKL2000 program package*’. The structure was determined by the multiple 
isomorphous replacement method with anomalous scattering (MIRAS) using 
the three heavy atom derivative data. The refinement of heavy-atom positions 
was performed with SHARP“. Phase calculations proceeded successfully with the 
autoSHARP pipeline”, with the reasonable figure of merit values before density 
modification (0.18 for centric and 0.26 for acentric reflections). An initial atomic 
model of AMTBD was built on the density-modified electron-density map 
using COOT”. An electron-density map was further improved by using non- 
crystallographic symmetry (NCS) averaging. The crystallographic refinement 
was performed by CNS* at 2.8 A resolution, and then the manual model correc- 
tion and refinement were iterated until convergence. The wild-type structure was 
determined using the molecular replacement method with the atomic model of 
AMTBD asa probe. Several rounds of rigid-body refinement gave a crystallographic 
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Ryork And Réree below 40%. At this stage, the MTBD of chain A was clearly seen in 
the electron-density map, and a homology model of the MTBD generated by 
SWISS-MODEL* was located in the density. All atomic coordinates were further 
refined by CNS at 3.8A resolution with the assistance of deformable elastic net- 
work (DEN) restraints**. The models of AMTBD and wild type have no 
Ramachandran outlier. Intra-molecular interactions were analysed using PISA 
and HBPLUS* with LigPlot+*°. The secondary structure elements were assigned 
using the program DSSP”’. We identified and analysed the coiled-coil interactions 
with SOCKET™, and also checked them with TWISTER”. All figures showing the 
atomic coordinates were made with PYMOL”. 

Steady-state biochemical assays. ATPase rates of HFG380B2 and its mutants 
(2-30 nM) were measured in a dynein assay buffer (10 mM K-PIPES, 50 mM 
potassium acetate, 4 mM MgSO,, 1 mM EGTA, and 1 mM DTT, pH 7.0) sup- 
plemented with 10 1M paclitaxel and the indicated concentration of microtubules 
using a coupled enzymatic assay (EnzChek phosphate assay kit, Invitrogen) at 
25°C after mixing with 1mM ATP, as described”. Note that we used the 
monomeric motor domain constructs for the functional analyses, and thus the 
Ky, (microtubule) value for microtubule-activated ATPase (~30 1M for wild- 
type constructs”) is higher than those reported for dimeric constructs”. 
Microtubule binding was measured by co-sedimentation with microtubules in 
the absence or presence of 3.3mM ATP in the assay buffer supplemented with 
5 uM free paclitaxel and 250 1M GTP at 25 °C, as described*”. To determine FRET 
efficiencies, fluorescence intensities of the BFP and GFP moieties of HFG380B2 
or its mutant (~120 nM) were measured in the assay buffer at 25 °C after mixing 
with 200 uM of the indicated nucleotide, and then FRET efficiencies were calcu- 
lated from the sensitized emission of the acceptor (GFP) as described’. 
Pre-steady-state phosphate release measurements. Pre-steady-state kinetic 
measurements were performed at 25°C using a PBP assay™* with an SX-18MV 
stopped-flow apparatus (Applied Photophysics). For the measurements, the 
motor domain (1 1M) in the assay buffer supplemented with 0.1 U ml”! purine 
nucleoside phosphorylase, 500 uM 7-methylguanosine, and 5 tM MDCC-PBP, 
was mixed with the same volume of a substrate solution (2 mM ATP and 40 uM 
microtubules) in the assay buffer supplemented with 0.1 U ml ' purine nucleoside 
phosphorylase, 500 1M 7-methylguanosine, 5 14M MDCC-PBP, and 10 uM free 
paclitaxel. Phosphate release from the motor domain was followed by exciting the 
reaction mixture at 425nm and monitoring the MDCC-PBP fluorescence 
enhancement using a 455-nm cutoff filter. 
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Very energetic astrophysical events are required to accelerate cosmic 
rays to above 10'* electronvolts. GRBs (y-ray bursts) have been pro- 
posed as possible candidate sources’ *. In the GRB ‘fireball’ model, 
cosmic-ray acceleration should be accompanied by neutrinos pro- 
duced in the decay of charged pions created in interactions between 
the high-energy cosmic-ray protons and y-rays*. Previous searches 
for such neutrinos found none, but the constraints were weak 
because the sensitivity was at best approximately equal to the pre- 
dicted flux*-’. Here we report an upper limit on the flux of energetic 
neutrinos associated with GRBs that is at least a factor of 3.7 below 
the predictions**°. This implies either that GRBs are not the only 
sources of cosmic rays with energies exceeding 10'* electronvolts or 
that the efficiency of neutrino production is much lower than has 
been predicted. 

Neutrinos from GRBs are produced in the decay of charged pions 
produced in interactions between high-energy protons and the intense 
y-ray background within the GRB fireball, for example in the 
A-resonance process p+y—A*—-n+x* (p, proton; y, photon 
(here y-ray); 4 * delta baryon; n, neutron; n, pion). When these pions 
decay via z” > pv, and * >e* ved, they produce a flux of high- 
energy muon neutrinos (v,,) and electron neutrinos (v,), coincident 
with the y-rays, and peaking at energies of several hundred tera- 
electronvolts (TeV)*!! (4°, antimuon; e*, positron). Such a flux 
should be detectable using km?-scale instruments like the IceCube 
neutrino telescope'*’’ (Supplementary Fig. 1). Owing to maximal 
mixing between muon neutrinos and tau neutrinos, neutrinos from 
pion decay in and around GRBs will arrive at Earth in an equal 
mixture of flavours. We focus here only on muons produced in v,, 
charged-current interactions. As the downgoing cosmic-ray muon 
background presents challenges for the identification of neutrino- 
induced muons, we achieve our highest sensitivity for upgoing 
neutrinos (from sources in the northern sky). However, the tight con- 
straint of spatial and temporal coincidence with a GRB allows some 
sensitivity even in the southern sky. One of the two analyses presented 
here therefore includes Southern Hemisphere GRBs during the 2009- 
10 IceCube run. 

The results presented here were obtained while IceCube was under 
construction, using 40 and 59 of the 86 photomultiplier strings of the 
final detector (Supplementary Fig. 1), which took data from April 2008 
to May 2009 and from May 2009 until May 2010, respectively. During 
the 59-string data-taking period, 190 GRBs were observed and 
reported by y-ray observatory satellites via the GRB Coordinates 
Network", with 105 in the northern sky. Of those GRBs, 9 were not 
included in our catalogue owing to detector downtime associated with 
construction and calibration. Two additional GRBs were included 
from test runs before the start of the official 59-string run. 117 northern- 
sky GRBs were included from the 40-string period’ to compute the 
final combined result. GRB positions were taken from the satellite with 
the smallest reported error, which is typically smaller than the IceCube 
resolution. The GRB y-ray emission start (T tart) and stop (Top) times 
were taken by finding the earliest and latest time reported for y-ray 
emission. 


As in our previous study’, we conducted two analyses of the IceCube 
data. In a model-dependent search, we examine data during the period 
of y-ray emission reported by any satellite for neutrinos with the 
energy spectrum predicted from the y-ray spectra of individual 
GRBs*’. The model-independent analysis searches more generically 
for neutrinos on wider timescales, up to the limit of sensitivity to small 
numbers of events at +1 day, or with different spectra. Both analyses 
follow the methods used in our previous work’, with the exception of 
slightly changed event selection and the addition of the Southern 
Hemisphere to the model-independent search. Owing to the large 
background of downgoing muons from the southern sky, the 
Southern Hemisphere analysis is sensitive mainly to higher-energy 
events (Supplementary Fig. 3). Systematic uncertainties from detector 
effects have been included in the reported limits from both analyses, 
and were estimated by varying the simulated detector response and 
recomputing the limit, with the dominant factor being the efficiency of 
the detector’s optical sensors. 

In the 59-string portion of the model-dependent analysis, no events 
were found to be both on-source and on time (within 10° of a GRB and 
between Tarr and Top). From the individual burst spectra®’ with an 
assumed ratio of energy in protons to energy in electrons ¢,/é, = 10 
(ref. 6), 8.4 signal events were predicted from the combined 2-year data 
set and a final upper limit (90% confidence) of 0.27 times the predicted 
flux can be set (Fig. 1). This corresponds to a 90% upper limit on ép/é, 
of 2.7, with other parameters held fixed, and includes a 6% systematic 
uncertainty from detector effects. 

In the model-independent analysis, two candidate events were 
observed at low significance, one 30s after GRB 091026A (event 1) 
and another 14h before GRB 091230A (most theories predict 
neutrinos within a few minutes of the burst). Subsequent examination 
showed they had both triggered several tanks in the IceTop surface air 
shower array, and are thus very probably muons from cosmic-ray air 
showers. In Fig. 2 are shown limits from this analysis on the normal- 
ization of generic power-law muon neutrino spectra expected from 
shock acceleration at Earth as a function of the size of the time window 
|At|, which is the difference between the neutrino arrival time and the 
first reported satellite trigger time. As a cross-check on both results, the 
limit from this analysis on the average individual burst spectra®’ 
during the time window corresponding to the median duration of 
the bursts in the sample (28 s) was 0.24 times the predicted flux, within 
10% of the model-dependent analysis. 

Assuming that the GRBs in our catalogue are a representative 
sample of a total of 667 per year (ref. 7), we can scale the emission 
from our catalogue to the emission of all GRBs. The resulting limits can 
then be compared to the expected neutrino rates from models that 
assume that GRBs are the main sources of ultra-high-energy cosmic 
rays**"°, with sampling biases of the same order as model uncertainties 
in the flux predictions’*”®. Limits from the model-independent ana- 
lysis on fluxes of this type are shown in Fig. 3. 

These limits exclude all tested models**”° with their standard 
parameters and uncertainties on those parameters (Figs 1, 3). The 
models are different formulations of the same fireball phenomenology, 


*Lists of participants and their affiliations appear at the end of the paper. 
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Figure 1 | Comparison of results to predictions based on observed y-ray 
spectra. The summed flux predictions normalized to y-ray spectra®””” are 
shown as a function of neutrino energy (E) in dashed lines, with the dark grey 
dashed line labelled ‘IC40 Guetta et al.’ showing the flux prediction for the 40- 
string portion of the analysis, and the black dashed line labelled ‘IC40+59 
Guetta et al.’ showing the prediction for the full two-year dataset. The cosmic 
ray normalized Waxman-Bahcall flux*”’ is also shown for reference as the pale 
grey dashed line. 90% confidence upper limits on these spectra are shown as 
solid lines, with the grey line labelled ‘IC40 limit’ showing the previous IceCube 
result® and the black ‘IC40+IC59 Combined’ line showing the result from the 
full dataset (this work). The predicted neutrino flux, when normalized to the 
y-rays®”, is proportional to the ratio of energy in protons to that in electrons, 
which are presumed responsible for the y-ray emission (¢,/é,, here the standard 
10). The flux shown is slightly modified® from the original calculation’. ®, (left 
vertical axis) is the average neutrino flux at Earth, obtained by scaling the 
summed predictions from the bursts in our sample (F,, right vertical axis) by 
the global GRB rate (here 667 bursts yr‘; ref. 7). The first break in the neutrino 
spectrum is related to the break in the photon spectrum measured by the 
satellites, and the threshold for photo-pion production, whereas the second 
break corresponds to the onset of synchrotron losses of muons and pions. Not 
all of the parameters used in the neutrino spectrum calculation are measurable 
from every burst. In such cases, benchmark values’ were used for the 
unmeasured parameters. Data shown here were taken from the result of the 
model-dependent analysis. 
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Figure 2 | Upper limits on E-” power-law muon neutrino fluxes. Limits 
were calculated using the Feldman-Cousins method”! from the results of the 
model-independent analysis. The left-hand y-axis shows the total number of 
expected v,, events, while the right-hand y-axis (F,) is the same as in Fig. 1. A 
time window of At implies observed events arriving between t seconds before the 
burst and f afterward. The variation of the upper limit (solid line labelled “90% 
Upper limit’) with At reflects statistical fluctuations in the observed background 
rate, as well as the presence of individual events of varying quality. The dashed 
line labelled “90% Sensitivity’ shows the upper limit that would have been 
obtained with exactly the mean expected background. The event at 30 s (event 1) 
is consistent with background and believed to be a cosmic-ray air shower. 
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Figure 3 | Compatibility of some neutrino flux predictions based on cosmic 
ray production in GRBs with observations. The cross-hatched area 

(IC40 +59 Allowed 90% CL’) shows the 90% confidence allowed values of the 
neutrino flux (vertical axes, as in Fig. 1) versus the neutrino break energy (é,) in 
comparison to model predictions with estimated uncertainties (points); the 
solid line labelled ‘IC50+59 Allowed 95% CL’ shows the upper bound of the 
95% confidence allowed region. Data were taken from the model-independent 
analysis from the time window corresponding to the median duration of the 
GRBs in our catalogue (|At| = 28s). Spectra are represented here as broken 
power laws (®,-{E” '/ép, E< 5 E ?, E> «,}) with a break energy & 
corresponding to the 4 resonance for p-y interactions in the frame of the shock. 
The muon flux in IceCube is dominated by neutrinos with energies around the 
first break (¢,). As such, the upper break, due to synchrotron losses of m*, has 
been neglected here, as its presence or absence does not contribute significantly 
to the muon flux and thus does not have a significant effect on the presented 
limits. , is related to the bulk Lorentz factor I (e, x I ); all of the models 
shown assume J’ ~ 300. The value of J’ corresponding to 10” GeV is >1,000 for 
all models. Vertical axes are related to the accelerated proton flux by the model- 
dependent constant of proportionality f,. For models assuming a neutron- 
decay origin of cosmic rays (ref. 8 and ref. 10) f, is independent of I’; for others 
(ref. 4) f; «x I” *. Error bars on model predictions are approximate and were 
taken either from the original papers, where included”’, or from the best- 
available source in the literature’ otherwise. The errors are due to uncertainties 
in f, and in fits to the cosmic-ray spectrum. Waxman-Bahcall* (circle) 

and Rachen® (box) fluxes were calculated using a cosmic-ray density of 
(1.5-3) X 10“*erg Mpc “yr ', with 3 X 10“ the central value”. The Ahlers'® 
model is shown with a cross. CL, confidence level. 


producing neutrinos at proton-photon (p-y) interactions in internal 
shocks. The remaining parameter spaces available to each model 
therefore have similar characteristics: either a low density of high- 
energy protons, below that required to explain the cosmic rays, or a 
low efficiency of neutrino production. 

In the GRB fireball, protons are believed to be accelerated 
stochastically in collisions of internal shocks in the expanding GRB. 
The neutrino flux is proportional to the rate of p—y interactions, and so 
to the proton content of the burst by a model-dependent factor. 
Assuming a model-dependent proton ejection efficiency, the proton 
content can in turn be related to the measured flux of high-energy 
cosmic rays if GRBs are the cosmic-ray sources. Limits on the neutrino 
flux for cosmic-ray-normalized models are shown in Fig. 3; each model 
prediction has been normalized to a value consistent with the observed 
ultra-high-energy cosmic-ray flux. The proton density can also be 
expressed as a fraction of the observed burst energy, directly limiting 
the average proton content of the bursts in our catalogue (Fig. 4). 

An alternative is to reduce the neutrino production efficiency, for 
example by modifying the physics included in the predictions'*” or by 
increasing the bulk Lorentz boost factor, J. Increasing J increases the 
proton energy threshold for pion production in the observer frame, 
thereby reducing the neutrino flux owing to the lower proton density at 
higher energies. Astrophysical lower limits on J are established by pair 
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Figure 4 | Constraints on fireball parameters. The shaded region, based on 
the result of the model dependent analysis, shows the values of GRB energy in 
protons and the average fireball bulk Lorentz factor for modelled fireballs*’ 
allowed by this result at the 90% confidence level. The dotted line indicates the 
values of the parameters to which the completed IceCube detector (IC86) is 
expected to be sensitive after 3 years of data. The standard values considered’ 
are shown as dashed-dotted and dashed lines and are excluded by this analysis. 
Note that the quantities shown here are model-dependent. 


production arguments’, but the upper limit is less clear. Although it is 
possible that [’ may take values of up to 1,000 in some unusual bursts, 
the average value is probably lower (usually assumed*? to be around 
300) and the non-thermal y-ray spectra from the bursts set a weak 
constraint that J’ <2,000 (ref. 18). For all considered models, with 
uniform fixed proton content, very high average values of J” are 
required to be compatible with our limits (Figs 3, 4). 

In the case of models where cosmic rays escape from the GRB 
fireball as neutrons*°, the neutrons and neutrinos are created in the 
same p-y interactions, directly relating the cosmic-ray and neutrino 
fluxes and removing many uncertainties in the flux calculation. In 
these models, Ialso sets the threshold energy for production of cosmic 
rays. The requirement that the extragalactic cosmic rays be produced 
in GRBs therefore does set a strong upper limit on J: increasing it 
beyond ~3,000 causes the proton flux from GRBs to disagree with the 
measured cosmic-ray flux above 4X 10'%eV, where extragalactic 
cosmic rays are believed to be dominant. Limits on Jin neutron-origin 
models from this analysis (2 2000, Fig. 3) are very close to this point, 
and as a result all such models—in which all extragalactic cosmic rays 
are emitted from GRBs as neutrons—are now largely ruled out. 

Although the precise constraints are model-dependent, the general 
conclusion is the same for all the versions of fireball phenomenology 
we have considered here: either the proton density in GRB fireballs is 
substantially below the level required to explain the highest-energy 
cosmic rays or the physics in GRB shocks is significantly different from 
that included in current models. In either case, our current theories of 
cosmic-ray and neutrino production in GRBs will need to be revisited. 
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Coherent quantum phase slip 


O. V. Astafiev'?, L. B. Ioffe, S. Kafanov'?, Yu. A. Pashkin)?*, K. Yu. Arutyunov”®, D. Shahar’, O. Cohen’ & J. S. Tsai? 


A hundred years after the discovery of superconductivity, one 
fundamental prediction of the theory, coherent quantum phase 
slip (CQPS), has not been observed. CQPS is a phenomenon exactly 
dual’ to the Josephson effect; whereas the latter is a coherent 
transfer of charges between superconducting leads*’, the former 
is a coherent transfer of vortices or fluxes across a superconducting 
wire. In contrast to previously reported observations** of incoher- 
ent phase slip, CQPS has been only a subject of theoretical 
study”"'”, Its experimental demonstration is made difficult by 
quasiparticle dissipation due to gapless excitations in nanowires or 
in vortex cores. This difficulty might be overcome by using certain 
strongly disordered superconductors near the superconductor- 
insulator transition. Here we report direct observation of CQPS in 
a narrow segment of a superconducting loop made of strongly dis- 
ordered indium oxide; the effect is made manifest through the 
superposition of quantum states with different numbers of flux 
quanta’’. As with the Josephson effect, our observation should lead 
to new applications in superconducting electronics and quantum 
metrology’'°". 

Phase slips across superconducting wires'* lead to non-zero resist- 
ance*’>; they may also lead to qubit dephasing, as result of charge 
noise’® in a chain of Josephson junctions’. Resistance measurements 
are dissipative, so its saturation at low temperatures cannot be inter- 
preted as evidence for coherent QPS; however, a blockade of super- 
current might indicate CQPS, as demonstrated in ref. 8. A system 
decoupled from the environment has been suggested’*: a supercon- 
ducting loop in which phase slips change the number of quantized 
fluxes’* resulting in their quantum superposition, therefore, exhibiting 
CQPS’. We report a successful implementation of this idea using 
superconducting loops made of highly disordered InO,. 

We begin by considering a superconducting loop of effective area 
S with a high kinetic inductance L,, containing a narrow segment 
(nanowire) with a finite rate of QPS. The external magnetic field Bex 
perpendicular to the loop induces a flux ®,,, = B.,,S. The states of the 
loop are described by the phase winding number, N; their energies are 
Ey =(®exe —N@y)” /2L, (Fig. 1a). The energy difference between 
adjacent states |N+1) and |N) is Ey+;— Ey=21,5®, where 
5D = Dey — (N + 1/2)@p and I, = Go/2L, is the loop persistent current. 
At o@ = 0, the two states are degenerate. The QPS process, characterized 
by the amplitude Es, couples the flux states, resulting in the Hamiltonian 


H=—5Es(IN+1)(N|+INMNFI))+ENIN)(N| (1) 


which is dual to the Hamiltonian of a superconducting island con- 
nected to a reservoir through a Josephson junction. In the latter, L; is 
replaced by capacitance, E, by the Josephson energy, and N is the 
number of the Cooper pairs on the island’*. The energy splitting 


0 
between the ground and excited states |g) = sin5 |N) + cos 5 |N+1) 


0 0 
and |e) = cos; |[N) — sin5 |N+1) is AE= (21,6@)* +E%, where 
the mixing angle is 0 = arctan[E,/(21,0®)]. 


To detect CQPS, the loop is coupled to the coplanar line (resonator) 
by mutual inductance, M (ref. 19). In a rotating wave approximation, 
the effective Hamiltonian of the system resonantly driven by a classical 
microwave field with current amplitude Ipwcos(AEt/h) is 


hQ E 
Hy =—5-(le)(gl + 1g) (el)» where h@=Mlplnw 


reduced Planck’s constant, h/27. Note that a transition between the 
two states can be induced only when Es 0, so observation of the 
spectroscopy peak constitutes direct evidence for CQPS. 

Now we provide theoretical background for our choice of material. 
CQPS results from quantum fluctuations of the order parameter. 
Generally, the impact of fluctuations is characterized by the 
Ginzburg parameter, G;, which is essentially the inverse number of 
Cooper pairs in a volume &: G,=(@vA)', where v is the electron 
density of states, A is the superconducting gap and € is the coherence 
length. Even in disordered bulk superconductors characterized by 
kgl ~ 1 (where kg is the Fermi wavevector and / is the mean free path), 
G; is small: G; ~ (kpé)"'< 1. Strong fluctuations require materials 
with an even higher degree of disorder, in which electrons are localized. 
The superconductivity is suppressed if the localization length, ¢,. 
becomes shorter than €. Following this reasoning, we expect that the 
fluctuations are maximal (G; ~ 1), when &,. ~ €, that is, close to the 
superconductor-insulator transition. Although BCS theory” fails to 
describe this transition, it provides a qualitatively correct explanation 
for the behaviour of materials with &,.~ ¢. The fluctuations are 


and fi is the 


oplanar lines 


Figure 1 | The device. a, Energies of the loop versus external flux, ®.x¢. Blue 
and red lines: ground and excited energy levels, respectively. The degeneracy 
between states with Nand N + 1 flux quanta (®9), seen at ®.., = (N+ 1/2)@p, 
is lifted by the phase-slip energy, Es. b, InO, loop with a narrow wire segment 
on the right side is attached to the resonator (horizontal line) at the bottom. 
c, False-colour scanning-electron micrograph of the narrow InO, segment. 

d, Step-impedance resonator, comprising a 3-jm-wide InO, strip with wave 
impedance Z, ~ 1,600 Q galvanically coupled to a gold coplanar line with 
impedance Zp = 50Q. The boundaries of the resonator are defined by the 
strong impedance mismatch (Z, > Zo). 
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enhanced in narrow wires with a small number of effective conductive 
channels, N, = Rx/Rz, where R: is the resistance of a wire of length 
€ and Ry = (h/e’). The phase-slip amplitude decreases with Nj, as 
Es x exp(—aNn) (refs 20, 21), where a is a numerical parameter of 
order 1. Our data are consistent with a~0.3 for Re=1kQ and 
€ = 10nm (Supplementary Information). Thus, Es is expected to be 
sizeable in strongly disordered, quasi-one-dimensional superconduct- 
ing wires. But a high degree of disorder may also enhance Coulomb 
repulsion between electrons, and turn the superconductor into a dis- 
sipative normal metal”. The ideal materials for observation of CQPS 
should therefore form localized, pre-formed pairs even before they 
become superconducting”. Such behaviour has been observed in 
amorphous InO, and TiN thin films*®*”’. 

Our loops (Fig. 1b) are fabricated from a 35-nm-thick super- 
conducting InO, film with ¢ = 10-30 nm (ref. 28), using electron- 
beam lithography. The uncertainty in the magnitude of ¢€ is due to 
the poor applicability of BCS theory (see Supplementary Information). 
Each loop contains a narrow wire segment, about 40nm wide and 
400 nm long (Fig. 1c). This wire is reasonably homogeneous, varying 
in width by less than 10 nm. We have characterized identical wires and 
films, fabricated by the same process, using d.c. transport measure- 
ments. The film exhibits a superconducting transition at T. = 2.7 K. 
The sheet resistance Rog of the wide films, measured slightly above T, 
is 1.7 kQ, which gives a rough (BCS) estimate of the sheet inductance as 
Lq=hRq/TA~0.7 nH (ref. 13) (for 4d ~ 0.5 meV; ref. 26) for the wide 
sections of the loops. 

To measure the loops, we incorporate them in a step-impedance 
coplanar resonator: a strip line of InO, oflength L = 1.5 mm and width 
W = 3 um, galvanically connected to two gold coplanar lines (with 
impedance Zp = 50Q) at the ends (Fig. 1d). We estimate the effective 
wave impedance of the line to be Z) = V/h/c1~1.6 kQ, with specific 
inductance |; = L,g/W ~ 2.3 X 10 *Hm ' and specific capacitance 
c, ~0.85X10 '°Fm’'). Because Z,;>>Zo, the strong mismatch 
results in standing wave formation with maximal current at the 
boundaries: I(x) =I; cos(mmx/L), where x is the coordinate along 
the resonator (0 = x = L). The resonance frequency of the mth mode 
fin (m= 1,2,3...) is given by fi, mv/2L =m X 2.4GHz, where 
v= (hc) 1? ~7.2 X 10°ms ‘is the group velocity in the resonator. 
The energy decay rate in such a resonator is kK © (4Z,/Z,)(v/L), which 
limits the power peak width to «/27 ~ 0.1 GHz. 

Our main results are as follows. Figure 2a shows the power trans- 
mission coefficient |f|* through the resonator at a temperature of 
40 mK. The peaks correspond to m = 3,4,5, with resonance frequencies 
fs = 6.65 GHz, fy = 9.08 GHz and f; = 11.00 GHz (close to our esti- 
mates above). The actual peak widths (full-width at half-maximum) 
are approximately 250 MHz (= x/27), which is twice as large as that 
expected from the loading loss. This is probably due to an extra loss in 
the gold ground-plane films. Our loops located in the middle of the 
resonator (x = L/2) are coupled only to the odd modes m, for which the 
current defined by Inw = Imncos(zm/2) is non-zero. 

To detect the superposition of flux states we measure transmission t 
through the resonator at f,, versus B.x. The transmission does not 
depend on B,,, at the third and fifth peaks. However, at the fourth 
mode peak, t exhibits well pronounced periodic structure: sharp nega- 
tive dips in the amplitude |t|, and in the phase rotation arg(t), as shown 
in Fig. 2b. The period, AB = 0.061 mT, corresponds with high accu- 
racy to one flux quantum, ®p, through the area S = 32 um? of the loop 
shown in Fig. 1b, with a 40-nm-wide wire. Note that there is a small 
(~5%) uncertainty in the loop area, owing to the finite width of the 
lines. Each resonator contains five loops, with nanowire widths 40, 60, 
80, 100 and 120 nm. The loops differ slightly in area, varying by 10- 
12% from one to another. No CQPS-related signals are found from the 
samples with nanowire segments wider than 40 nm, indicating that in 
these loops Es/h < 1 GHz, in good agreement with our prediction that 
Eg will be suppressed by more than a factor of 10 with an increase of 
20 nm in the nanowire width. (See Supplementary Information.) 
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Figure 2 | Experimental data. a, Power transmission through the resonator, 
measured within the bandwidth of our experimental setup. Peaks in 
transmission power coefficient, |t By correspond to resonator modes, with 
mode number m indicated for each peak. a.u., arbitrary units. b, Transmission 
through the resonator as function of external magnetic field B.., at m = 4 

(f4 = 9.08 GHz). The periodic structure in amplitude (|t|) and phase (arg(¢)) 
corresponds to the points where the lowest-level energy gap AE/h matches fy. 
The period AB = 0.061 mT ( = @)/S) indicates that the response comes from 
the loop (shown in Fig. 1b), with the effective loop area S = 32 um?. c, The two- 
level spectroscopy line obtained in two-tone measurements. The phase of 
transmission, arg(t), through the resonator at f, is monitored, while another 
tone with frequency fprobe from an additional microwave generator, and Bext, 
are independently swept. The plot is filtered to eliminate the contribution of 
other resonances (2 = m $6), visible as horizontal red features. The dashed 
line is the fit to the energy splitting, with AE/h = 4.9 GHz, I, = 24nA. d, The 
resonant dip is measured at ®/®p = 0.52. The red curve is the Gaussian fit. 


Although the coupling of our loop to the resonator (g/h ~ 10 MHz) 
is weak (g< </ik), we were able to perform spectroscopy measure- 
ments by monitoring resonator transmission” while scanning Bex 
and the frequency, fprobe of an additional probe microwave tone. The 
transmission phase plot shows the resonance excitation of the two- 
level system (Fig. 2c). The obtained structure is distorted by periodic 
resonances at f,,, (seen as red horizontal features), and the resulting 
picture is plotted after filtering out these resonances. The green—-blue 
line corresponds to the expected energy splitting, which is well fit by 


AE=\/ (21,6@)° +E% (dashed line), with the fitting parameters 
Es/h < 4.9 GHz and I, = 24nA, corresponding to Ll, = 42 nH. 

The exact splitting at the degeneracy point 6@=0 happens to 
nearly coincide with f,~4.8GHz, distorting the line shape. 
Figure 2d shows the resonant peak at ®..; = 0.52 ®o, slightly away 
from the degeneracy point, where the resonant frequency 
AE/h ~ 5.6 GHz is between f; and f,. The spectroscopy peak is well 


1 (f —AE/h\* 
fit by a Gaussian, A exp | — 5 (He) , with Af =260MHz. 


This demonstrates coherent coupling between the flux states in the 
loop. An interesting question, which requires further study, is the 
mechanism of decoherence in this type of system. At this stage, we 
can only conjecture that the shape suggests peak broadening due to 
low-frequency Gaussian noise, rather than relaxation. 

We now discuss the relation between the measured and expected 
properties of the system. The total loop resistance R (at T > T.) is the 
sum of the resistances of the wide section (estimated from R,,) and of 
the nanowire, Ry ~ 30kQ, deduced from the d.c. measurements of 
identical samples. We estimate R ~ 55kQ. As one might expect, the 
effective sheet resistance of the narrow wires is higher than R,,—in our 
case, by a factor of 1.8. The BCS estimate of the inductance of the whole 
loop gives L;, ~ 23 nH. Close to the superconductor-insulator transition 
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(SIT), R rises continuously with decreasing temperature before the 
onset of superconductivity, whereas L, diverges at the transition. 
Arguing by continuity, BCS theory should underestimate L, in the 
strongly disordered superconductors (see Supplementary Informa- 
tion). We underestimate L; in our films by a factor of 1.8, which is 
very reasonable for the superconductors close to the SIT. 

Another measure of the wire quality is the value of its critical 
current. For a BCS superconductor, the maximal supercurrent’’ is 
I. ~0.754/eRz (Supplementary Information). Taking into account 
that in the vicinity of the SIT, I. is overestimated by at least the same 
factor of 1.8 (as with L,), we find I, = 280-90 nA, for € = 10-30 nm. 
The measured value, J. = 100nA, is in full agreement with these 
expectations. 

The quantitative estimate of the QPS amplitude is less accurate than 
for kinetic inductance or critical current, owing to its exponential 
dependence on sample and material parameters. Moreover, in the 
BCS approximation, QPS amplitude explicitly depends on ¢, which 
is not well defined close to the SIT. Rough estimates based on ref. 15 
give values of ~5-20 GHz for € = 10 nm; our calculations, based on 
the measured value of the sheet kinetic inductance, yield Es ~ 10 GHz, 
in agreement with the data (see Supplementary Information). 

We have obtained additional evidence for CQPS, as follows. In 
addition to the data presented here for a single sample, we observed 
very similar results in two other samples fabricated from the same InO, 
film. One of those samples was measured in the step-impedance res- 
onator described above, and the other in an open-line configuration”. 
In all three samples, we found resonances with close values of persistent 
current, and level splitting Es/h equal to 4.9, 5.8 and 9.5 GHz, respec- 
tively. The variation in Es is surprisingly low (given its exponential 
dependence on parameters), and is in agreement with expectations (see 
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Figure 3 | Spectroscopy of the system across a wide range of flux and 
frequency. The spectroscopy response is obtained by measuring the 
transmission amplitude and phase through the resonator at f, = 9.08 GHz, 
while the frequency of the additional microwave tone fprobe is swept. Depending 
on the range, the two-level system excitation frequency AE/h is derived from 
the direct (single-photon) excitation AE/h = fprobe (blue circles), the two- 
photon process AE/h = fprobe + fa (green circles), or the three-photon process 
AE/h = 2fprobe + fa (red circles). With these methods, using fprobe = 35 GHz, we 
could trace AE/h up to about 77 GHz. The dashed black line is the calculated 
\/ (2lp6®)? +B8 
energy splitting, AE/h= 


, with Es = 4.9 GHzand I, = 24nA. 


Perfect agreement of the experimental data with the calculated energy bands 
supports our interpretation; the quantum states of the system correspond to the 
superposition of the two adjacent flux states induced by CQPS. The rare 
scattered points result from noise, resonator resonances and higher-order 
excitation processes. For comparison, the additional dotted cyan lines show the 
expected behaviour of a radio-frequency-SQUID qubit with Ej/h = 50 GHz 
(I. = 100 nA), capacitance C = 1.1 fF and inductance L = 38 nH, which 
qualitatively disagrees with our data at the degeneracy points ®..; = 0 and 
Dex = Po. 
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Supplementary Information). This reproducibility is a strong argu- 
ment against an alternative interpretation of the results based on unin- 
tentional formation of a rogue Josephson junction somewhere in the 
wire. Second, as in a charge qubit (which is dual to our system with 
CQPS), the energy difference between the two lowest states in the loop 
asymptotically approaches linear dependence, AE=2I,6®, at 
AE > Eg. In our experiments, we are able to trace the two-level system 
resonance up to |6®| = 0.5, and AE ~ 77 GHz (Fig. 3). This linear 
dependence, without any observable splitting at ® = 0 and ®y (cor- 
responding to the second-order 47-process), indicates a linear induct- 
ance in our system, rather than a nonlinear one originating in a rogue 
Josephson junction. To support this statement, we present simulations 
of the spectroscopy lines (dotted cyan lines in Fig. 3) for a radio- 
frequency-SQUID qubit with a single Josephson junction (with 
¢ = 100nA), and with capacitance and linear inductance adjusted to 
provide the best fit of our spectroscopy data. The simulations show a 
significant rounding at ®=0 and po, owing to the nonlinear 
Josephson inductance, in qualitative disagreement with our results. 
These arguments allow us to exclude a rogue Josephson junction 
formed in our device; however, we cannot distinguish a number of 
interfering phase slip locations in the wire, if, for example, the wire is 
not uniform. 
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Air density 2.7 billion years ago limited to less than 
twice modern levels by fossil raindrop imprints 


Sanjoy M. Som'?*+, David C. Catling’, Jelte P. Harnmeijer'’, Peter M. Polivka'* & Roger Buick! 


According to the ‘Faint Young Sun’ paradox, during the late 
Archaean eon a Sun approximately 20% dimmer warmed the early 
Earth such that it had liquid water and a clement climate’. 
Explanations for this phenomenon have invoked a denser atmo- 
sphere that provided warmth by nitrogen pressure broadening’ or 
enhanced greenhouse gas concentrations’. Such solutions are 
allowed by geochemical studies and numerical investigations that 
place approximate concentration limits on Archaean atmospheric 
gases, including methane, carbon dioxide and oxygen” ’. But no field 
data constraining ground-level air density and barometric pressure 
have been reported, leaving the plausibility of these various hypo- 
theses in doubt. Here we show that raindrop imprints in tuffs of the 
Ventersdorp Supergroup, South Africa, constrain surface air density 
2.7 billion years ago to less than twice modern levels. We interpret 
the raindrop fossils using experiments in which water droplets of 
known size fall at terminal velocity into fresh and weathered volcanic 
ash, thus defining a relationship between imprint size and raindrop 
impact momentum. Fragmentation following raindrop flattening 
limits raindrop size to a maximum value independent of air density, 
whereas raindrop terminal velocity varies as the inverse of the square 
root of air density. If the Archaean raindrops reached the modern 
maximum measured size, air density must have been less than 
2.3kg m7 *, compared to today’s 1.2 kg m~*, but because such drops 
rarely occur, air density was more probably below 1.3 kgm *. The 
upper estimate for air density renders the pressure broadening 
explanation’ possible, but it is improbable under the likely lower 
estimates. Our results also disallow the extreme CO, levels required 
for hot Archaean climates’. 

Numerical investigations of Archaean atmospheric composition” ® 
typically assume a modern, total atmospheric pressure of about one 
atmosphere (1 atm), but there are good reasons why barometric pressure 
may have been different. First, the partial pressure of oxygen po, was 
negligible before the Great Oxidation Event at around 2.35 billion years 
ago’. There are several independent lines of evidence for this’®, the 
strongest being widespread and large mass-independent fractionations 
of sulphur isotopes in Archaean sediments that arise only in an atmo- 
sphere with less than about one part oxygen per million by volume 
(p.p.m.v.)"*. Second, before the advent of an aerobic nitrogen cycle 
coincident with rising oxygen levels’’, the flux of nitrogen back to 
the atmosphere via the now-dominant nitrification—-denitrification 
pathway would have been different from now. So a lack of oxygen 
before the Great Oxidation Event should have affected the partial 
pressure of nitrogen py,, the major gas contributing to total atmo- 
spheric pressure. Moreover, it has been calculated that a py, of 
2.37 atm at 2.5 billion years ago could solve the ‘Faint Young Sun’ 
paradox by pressure-broadening infrared absorption of greenhouse 
gases’. Other studies postulate a hot (~70 °C) Archaean ocean based 
on oxygen isotopes in cherts’’, requiring a partial pressure of carbon 
dioxide poo, of about 2-6 bar (ref. 8), which would contradict the pco, 
levels of only 10-50 present atmospheric levels (PAL) constrained 


from 2.69-billion-year-old palaeosols’. Such ambiguities concerning 
the composition of the ancient atmosphere could be resolved, or 
improved upon, by knowledge of total atmospheric pressure. Here, 
we use raindrop imprints to constrain total ground-level atmospheric 
density (and thus total surface pressure) 2.7 billion years before pre- 
sent. The idea of using raindrop imprints as a proxy for air density was 
suggested by Lyell’* in 1851 but has hitherto been unexplored. 

On the ancient Earth, maximum raindrop diameters should have 
been essentially identical to today’s, because the maximum size beyond 
which raindrops disintegrate at terminal velocity is independent of air 
density. Falling raindrops flatten'*'® and fragment when the total 
aerodynamic forces exceed the combination of surface tension and 
hydrostatic forces'’. Fragmentation begins when the raindrop bottom 
becomes flat at a force balance given by”: 

8y 
Veermd = —— l 

— p air!Ca ( ) 
where Vierm is terminal velocity, d is the diameter of a sphere equival- 
ent to the drop volume, y is surface tension, /,i; is air density, Cg is the 
drag coefficient, and n is a factor relating the radius of the upper 
curvature of the drop to its spherical equivalent radius. Theory relates 
terminal velocity to raindrop size'™" and predicts 9.3 ms‘ fora rain- 
drop of 6.8 mm in diameter, the largest measured raindrop at ground 
level’®. Typical values under standard surface atmospheric conditions 
(y=7X107Nm ‘and p,;, = 1.2 kg m’°) yield a constant value of 
0.80 for nCq in equation (1), comparing favourably with nCq = 0.85 
from independent studies’’ and consistent with observations that the 
product Viermd is constant?)”?. 

A further relationship derived from empirical correlations exists 
between air density and maximal terminal velocity'*”**: 


PwaterY °?5 a known constant 
Veerm,max =2 Gua au — | (5s .= =: (2) 
Pair Pair 
where g is gravitational acceleration and Pater is the density of water. 
Equation (2) also gives a maximum terminal velocity of 9.3 ms *, which 
corresponds to the largest raindrops of 6.8 mm in diameter. Substitution 
of Vierm Pair ? from equation (2) into d« (VtermPair) | from 
equation (1) cancels out pi, showing that maximum raindrop size is 
independent of air density. Drop equivalent diameter d is thus simply a 
function of surface tension y. The slight increase of surface tension with 
temperature causes only a trivial terminal velocity change of 0.05 ms ' 
over 15-30 °C, meaning that somewhat different Archaean temperatures 
would not affect our conclusions. Consequently, an upper bound on air 
density can be derived from the largest raindrop imprints, formed by the 
transfer of momentum from the largest impacting raindrops to the 
substrate. 

The Archaean imprints studied here (Fig. 1 and Supplementary 
Information) are from the Omdraaivlei farm near Prieska, South 
Africa, in the Kameeldoorns Formation of the Platberg Group, the 
middle unit of the 2.7-billion-year-old Ventersdorp Supergroup” 
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Figure 1 | The 2.7-billion-year-old Ventersdorp Supergroup raindrop 
imprints lithified in tuff at Omdraaivlei, South Africa. a, Detail of slightly 
elliptical outlines of raindrop imprints. b, Cross-section photograph of 
imprints penetrating 1-2 mm into coarse accretionary lapilli tuff, and draped 


(Supplementary Fig. 2). They penetrate into very poorly sorted fine tuff 
of intermediate igneous composition. A layer of pale, very fine volcanic 
ash 0.5-0.8 mm thick drapes the imprints (Fig. 1b and Supplementary 
Information), which reduces the diameter of the exposed imprints 
relative to their original diameter by about twice the drape thickness. 
The rimmed craters are well preserved, slightly elliptical in shape and 
occasionally overlap, suggesting that the rain event that formed them 
was of short duration and of light to moderate intensity, because high- 
intensity rainfall leads to distorted imprints and long-duration showers 
cause substantial overlap*’. They were probably formed in an inland 
semi-arid setting near sea level (Supplementary Information). 

The dominant contributor to imprint size is the change in raindrop 
momentum during impact’’. We define a dimensionless momentum J 
as: 


= Vierm™a (3) 
nAa 


where mg is the mass of the raindrop, 7 is the dynamic viscosity 
(independent of pir), and Ag is the cross-sectional area of the falling 
drop. Obtaining atmospheric density from lithified raindrop imprints 
requires: (1) measuring raindrop imprint areas; (2) determining 
experimentally how J varies with imprint area by varying d and thus 
mg in equation (3) (Fig. 2); and (3) relating atmospheric density to J 
(Fig. 3). Archaean imprints were measured in the field, and sub- 
sequently re-measured by high-resolution three-dimensional laser 
scanning of latex peels taken in situ. The relationship between drop 
impact momentum and corresponding imprint area was obtained 
from experiments in which we released water drops of known mass 
from an indoor height sufficient to guarantee that they reached ter- 
minal velocity onto ash substrates analogous to the Archaean tuff. One 
of the experimental ashes was fresh from the 2010 Eyjafjallajokull 
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Figure 2 | Experimental relationship between raindrop area and 
dimensionless momentum. The horizontal error bars are the uncertainty of 


raindrop mass propagated to dimensionless momentum; the vertical error bars 
express the corresponding standard deviation of crater dimensions. 
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with a thin veneer (about 0.5 mm) of light-toned, fine volcanic ash. Scale bar, 
10mm. ¢, Mildly increased imprint density on the windward (north-facing) 
faces of underlying symmetrical wave-ripples. (Photo credits: W. Altermann 
for a and c, and T. Tobin for b). 


eruption in Iceland, and the other was weathered late Pleistocene 
Pahala ash from Hawaii**. Both were from mafic to intermediate hydro- 
volcanic eruptions with similar grain-size distributions to the ash host- 
ing the Archaean raindrop imprints. The relationship between air 
density and dimensionless momentum was obtained by extrapolating 
from previous work relating air density to terminal velocity’® (Sup- 
plementary Information). 

The actual raindrop diameter that formed the largest imprints 
found at Omdraaivlei is unknown. Smaller imprint area reflects lower 
raindrop velocity and thus higher air density. Figure 4 illustrates the 
expected atmospheric density when the raindrop size that caused the 
largest Omdraaivlei imprint is varied. To calculate an atmospheric 
density upper bound, we use the lower bound on the largest raindrop 
imprint area, because smaller imprint areas reflect lower raindrop 
terminal velocities and hence higher air density. The largest imprint 
area is bounded by Ajatex — lo and Ainsitu + 1o, where Ain situ is the 
mean maximum imprint area measured in the field, and Ajatex is the 
mean maximum imprint area measured from the latex peels 
(Supplementary Information). Finding the corresponding air density 
for these end-member dimensions for a fixed raindrop size defines the 
error in air density. The air density upper bound is calculated as 
Ajatex — lo. Using a drop diameter of 6.8 mm—the size of the largest 
raindrop ever measured at ground level’® and also the theoretical 
maximum size—we obtain an absolute upper limit of less than 2.3 kg 
m_ °. However, because rainfall events producing these maximal drops 
are extremely rare, very intense, and highly erosive”, it is more likely 
that the maximum raindrop size responsible for creating the Archaean 
imprints had an equivalent diameter of 3.8-5.3 mm, depending upon 
the choice of parameterization of the raindrop size distribution and 
assuming that the probability distribution functions for rainfall rates in 
inland semi-arid settings were similar on the Archaean Earth and the 
modern Earth. These dimensions correspond to a more likely upper 
limit for atmospheric density of 0.6-1.3kgm °. 

Estimates of atmospheric pressure from the ideal gas law P = p,;,RT 
require assumptions about air temperature T and, through the specific 
gas constant R, atmospheric composition. Regarding temperature 
2.7 + 0.1 billion years ago, no evidence of glaciation is present in the 
rock record. This may reflect lack of preservation, but if it is real, an 
absence of glaciation requires average temperatures to have been 20 °C 
or higher, according to data from non-glacial times in the Phanerozoic 
eon’’. This is also consistent with Archaean temperatures of less than 
40°C based on oxygen isotope systematics*’. Taking a nominal tem- 
perature of about 20 °C, we calculated an upper limit on atmospheric 
pressure by choosing a composition that maximizes R. A 100% N2 
atmosphere (R= 297} kg! K 1, versus R= 253 Jkg' K! for a 
70% N,+30% CO, atmosphere) constrains atmospheric pressure 2.7 
billion years ago to below 0.52-1.1 atm if we take pai, as less than 0.6- 
1.3kgm7° or an absolute upper limit of less than 2.1 atm if we take 
Pair = 2.3kg m7. 
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Figure 3 | Theoretical predictions of the variation of air density with 
terminal velocity and dimensionless momentum at the surface. a, The 
relationship between air density and terminal velocity, following the method of 


Our result extends quantifiable atmospheric pressure determina- 
tions beyond the modern era to the early Earth. It places constraints 
on some Archaean climate models'”’, but does not invalidate other 
proposed late-Archaean atmospheres” *. For models invoking high py, 
as a means of pressure-broadening greenhouse gas absorption’, only 
the lowest estimate of 1.58 atm is close to our findings, suggesting a 
nitrogen content in the late-Archaean atmosphere of no more than 
twice present levels. Our result rules out very high Archaean ocean 
temperatures of 70 °C-85 °C (refs 13 and 31), because these would 
necessitate about 2-6 bar of carbon dioxide® plus 0.3-0.6 bar of water 
vapour, increasing barometric pressure far beyond the upper limit 
found here. Thus, neither strong pressure-broadening nor extreme 
Pco, are satisfactory mechanisms for warming the early Earth illumi- 
nated by a ‘Faint Young Sun’. 
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Figure 4 | Atmospheric density given the maximum raindrop diameter that 
created the largest imprints at Omdraaivlei, South Africa. Squares represent 
the air density calculated from the maximum imprint areas measured in situ 
(ash drape removed). Circles represent the air density calculated from the 
maximum imprint areas from latex measurement of imprint areas (ash drape 
removed). Dashed lines represent 1¢ error. With the assumption that the 
rainfall rate probability distribution function responsible for the Archaean 
imprints is analogous to a modern, semi-arid, rainfall rate probability 
distribution function, there is a 78-99% probability that the maximum 
raindrop diameter was less than 3.8-5.3 mm (Supplementary Fig. 5, 
Supplementary Table 1, and Supplementary Information). 
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ref. 18. b, The relationship between air density and dimensionless momentum 
(see Supplementary Information). 


METHODS SUMMARY 


We measured the Ventersdorp raindrop imprints directly in the field and by 
casting them using low-viscosity latex for later laboratory study. The resulting 
latex peels captured the dimensions of 955 individual raindrop imprints. The 
topography of the peels was measured using high-resolution three-dimensional 
laser scanning. The corresponding point-clouds (available at http://gis.ess. 
washington.edu/papers/Sanjoy_Som_raindrops/) were interpolated using an 
inverse data-weighing scheme to create a digital elevation model. The digital 
elevation models were imported into a Geographical Information System and 
the dimensions of the imprints extracted. The dimensions were optimally binned, 
with the largest bin corresponding to the measurement of the largest imprints, and 
the dimension of each bin reflecting error in measurement. 

To find the relationship between raindrop imprint dimension and dimension- 
less momentum, we released droplets of different (known) mass from a height of 
27 m indoors onto analogous ash taken from Iceland and Hawaii. This height is 
more than double that required for all drops to reach terminal velocity. We could 
calculate the dimensionless momentum of all impacting raindrops because ter- 
minal velocity is predictable. The resulting imprinted ash substrates were lithified 
using hair spray and low-viscosity liquid urethane plastic. The dimensions were 
measured using the same laser scanner as that used for the latex peels. Each 
imprinted tray captured a dozen imprints originating from raindrops of the same 
mass, from which a mean and standard deviation were obtained. 

We followed published methods’? to predict theoretically from first principles 
how raindrop terminal velocity changes with air density, and thus how dimension- 
less momentum changes with air density. Given the measurement of the largest 
Ventersdorp imprint, we obtained the corresponding dimensionless momentum 
of the impacting drop using our experimental relationship. By assuming the 
dimension of the raindrop responsible for the largest imprint (bounded by the 
maximum diameter of 6.8 mm), we quantified atmospheric density (Supplemen- 
tary Information). 
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Formation of the ‘Great Unconformity’ as a trigger 
for the Cambrian explosion 


Shanan E. Peters! & Robert R. Gaines” 


The transition between the Proterozoic and Phanerozoic eons, 
beginning 542 million years (Myr) ago, is distinguished by the 
diversification of multicellular animals and by their acquisition of 
mineralized skeletons during the Cambrian period’. Considerable 
progress has been made in documenting and more precisely correl- 
ating biotic patterns in the Neoproterozoic-Cambrian fossil record 
with geochemical and physical environmental perturbations” *, but 
the mechanisms responsible for those perturbations remain uncer- 
tain’. Here we use new stratigraphic and geochemical data to show 
that early Palaeozoic marine sediments deposited approximately 
540-480 Myr ago record both an expansion in the area of shallow 
epicontinental seas and anomalous patterns of chemical sedimenta- 
tion that are indicative of increased oceanic alkalinity and enhanced 
chemical weathering of continental crust. These geochemical 
conditions were caused by a protracted period of widespread 
continental denudation during the Neoproterozoic followed by 
extensive physical reworking of soil, regolith and basement rock 
during the first continental-scale marine transgression of the 
Phanerozoic. The resultant globally occurring stratigraphic surface, 
which in most regions separates continental crystalline basement 
rock from much younger Cambrian shallow marine sedimentary 
deposits, is known as the Great Unconformity®. Although Darwin 
and others have interpreted this widespread hiatus in sedimenta- 
tion on the continents as a failure of the geologic record, this palaeo- 
geomorphic surface represents a unique physical environmental 
boundary condition that affected seawater chemistry during a time 
of profound expansion of shallow marine habitats. Thus, the forma- 
tion of the Great Unconformity may have been an environmental 
trigger for the evolution of biomineralization and the ‘Cambrian 
explosion’ of ecologic and taxonomic diversity following the 
Neoproterozoic emergence of animals. 

The term Great Unconformity was first used in the year 1869 to 
describe the prominent stratigraphic surface in the Grand Canyon that 
separates the shallow marine, ~525-Myr-old Cambrian Tapeats 
Sandstone from the underlying metamorphosed, 1,740-Myr-old 
Vishnu Schist and structurally tilted sedimentary rocks of the 1,200- 
740 Myr-old Grand Canyon Supergroup®. The Great Unconformity is 
well exposed in the Grand Canyon, but this geomorphic surface, which 
records the erosion and weathering of continental crust followed by 
sediment accumulation, can be traced across Laurentia and globally, 
including Gondwana’®, Baltica’, Avalonia’ and Siberia’®, making it the 
most widely recognized and distinctive stratigraphic surface in the rock 
record. It is also notable because the Cambrian sediments that overlie it 
in many regions preserve the first skeletonized crown-group animals, a 
fact that some palaeontologists have interpreted as evidence for stra- 
tigraphic bias and an incomplete record of early animal evolution’’. 

Here we use stratigraphic and lithologic data for 21,521 rock units 
from 830 geographic locations in North America, in conjunction with 
petrologic and geochemical data (Methods; see also Supplementary 
Information), to explore the hypothesis that the formation of the 
Great Unconformity is causally linked to the evolution of biominer- 
alization; this linkage is proposed to occur by means of the geochemical 


effects of prolonged continental denudation followed by enhanced 
physical and chemical weathering of continental crust during terminal 
Ediacaran and Cambrian time. 

The Cambrian- to Early Ordovician-aged sediments of the Sauk 
Sequence’””* that overlie the Great Unconformity are time-transgressive, 
such that Early Cambrian sediments occur on the margins of the 
palaeocontinents and Late Cambrian sediments overlie the Great 
Unconformity in continental interiors (Fig. 1). The spatial extent of 
the Sauk Sequence is comparable to other Phanerozoic continent-scale 
sedimentary sequences'’”’, but its geological characteristics are 
unique. In most places, undeformed Cambrian sedimentary rocks 
deposited on Earth’s surface rest non-conformably on much older 
continental crystalline basement rocks, many of which were formed 
and/or metamorphosed within the Earth’s crust (Fig. 2a). Thus, the 
Great Unconformity marks the termination of an extended period of 
continental denudation that exhumed and exposed large areas of 
igneous and metamorphic rocks to subaerial weathering before marine 
transgression and subsequent sedimentation. 

Continental-scale marine transgression during the Cambrian—Early 
Ordovician accentuated rates of weathering on the Great Unconformity 
by shifting landward the position of the erosive transgressive shoreface 
system, often called the ‘wave-base razor’ !*, as well as adjacent trans- 
itional backshore, aeolian and fluvial systems. As a result, much of the 
soil and weathered basement rock (regolith) that covered low-relief 
continental interiors’ was eroded and mobilized during the transgres- 
sion, thereby exposing silicate mineral surfaces to weathering over an 
area that is unprecedented in the rock record (Fig. 2a). This is import- 
ant because freshly exposed rock weathers chemically at rates more 
than three times faster than undisturbed soils and regolith’*', and 
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Figure 1 | Sauk Sequence in North America. Distribution and age of the 
oldest Phanerozoic sedimentary rocks in North America. 
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Figure 2 | Phanerozoic sedimentation patterns in North America. 

a, Minimum area of crystalline basement rock exposed at surface before burial 
by sediments in each epoch. b, Estimated minimum total carbonate burial flux 
in each epoch, based on lithostratigraphic units that are composed primarily of 
carbonate (Fig. 3c). c, Proportion of glauconite-rich siliciclastic sedimentary 
rock units (Fig. 3d). Grey bars span the Cambrian—Early Ordovician Sauk 
Sequence. Error bars, +1 standard error. O, Ordovician; S, Silurian; D, 
Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; 
Pg, Palaeogene; Ng, Neogene. 


because the chemical weathering of basement-derived silicate minerals 
in soils and regolith is enhanced by physical reworking’. The sharp 
stratigraphic contact that typically separates Precambrian basement 
from the quartz- and feldspar-rich marine sands of the basal, time- 
transgressive Sauk Sequence (Fig. 3e; see Supplementary Information) 
demonstrates the efficacy of transgressive shoreface systems in stripping 
soils and regolith. Shorter-duration eustatic sea level falls, superimposed 
on the longer-term Sauk transgression, as well as at the terminal 
Ediacaran’, provided an additional mechanism for subaerial exposure, 
reworking and chemical weathering of basement- and regolith-derived 
sediments. 

Exposure and chemical weathering of silicate minerals derived from 
continental basement rocks during the formation of the Great 
Unconformity exerted an important control on seawater chemistry 
and global biogeochemical cycling in several ways, including the con- 
sumption of atmospheric CO, and the release of HCO; , Ca", 
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H3SiO,4 , SO, , Cl’, Na‘, Mg*", K*, Fe’* and other ions to the 
oceans’*"*, We find sedimentological evidence for an increased 
reservoir size and a sustained flux of continental weathering products 
to the ocean within each of the three primary facies belts that onlap 
Cambrian palaeocontinents’®: the nearshore, generally coarse-grained 
inner detrital belt, the middle carbonate platform belt, and the outer, 
fine-grained detrital belt (Fig. 3). 

The signal of enhanced continental crustal weathering is perhaps 
most conspicuously expressed by precipitation of carbonate sedi- 
ments, which reached a Phanerozoic peak in shelf burial flux during 
the Sauk transgression (Figs 2b, 3c)'”’*. In Laurentia, the large quantity 
of Cambrian-Early Ordovician carbonates is known as the ‘Great 
American Bank’. Precipitation of carbonates is an important sink 
for alkalinity that is derived from chemical weathering, and we find 
that the area of continental basement exposed before sedimentation in 
each time interval (Fig. 2a) is predictive of changes in carbonate burial 
rates throughout the Phanerozoic (Fig. 2b; partial linear product- 
moment correlation between first differences, with effect of varying 
interval duration held constant, is 0.74, P=5 X 10°). These results 
are consistent with a recent model of the Cambrian carbon cycle, 
which demonstrated that unusually large absolute rates of carbon 
throughput are required to explain global carbon isotopic excursions’. 
In the absence of a deep-sea sink for carbonate, which did not become 
established until the Mesozoic origin of calcareous plankton'*”, 
Cambrian carbonate burial rates were limited by the rate at which shelf 
accommodation space was created by sea level rise during the Sauk 
transgression and by lithospheric subsidence. Thus, it is likely that in 
the earliest Cambrian ocean there was both a large pre-existing 
reservoir of chemical weathering products and an enhanced flux 
originating from the weathering of continental crust. 

Additional evidence for an unusually high flux of continental 
weathering products to the Cambrian ocean, and a corresponding 
increase in the extent of carbonate precipitation, is provided by 
petrographic and geochemical data from outer detrital belt 
mudstones”. Although the Phanerozoic rock record is replete with 
carbonate cemented mudstones, most cements form as a result of 
microbially mediated remineralization of organic matter and, therefore, 
have 5'°C values that are significantly depleted relative to sea water”. 
Outer detrital belt mudstones of the Sauk Sequence, by contrast, not 
only contain a significantly greater wt% carbonate than composite shale 
standards, but they also yield 8'*C values that are consistent with direct 
precipitation from sea water (see Supplementary Information). 
Petrographic analysis reveals that carbonate cements have textures 
indicative of rapid, displacive growth at bed tops, near the sediment- 
water interface” (Fig. 3b) and that detrital micrite is rare or absent. 
Furthermore, many outer detrital belt mudstones contain carbonate 
concretions and/or mud mounds with 5'°C values indicative of pre- 
cipitation from seawater-derived bicarbonate”? (see Supplementary 
Information). 

The signal of enhanced weathering of continental crust during the 
Sauk transgression is evidenced in the inner detrital belt by a peak in 
the frequency of glauconite-rich siliciclastic rock units (Figs 2c, 3d). 
Glauconite, (K,Na)(Fe** ,AlLMg)2(Si,Al)4019(OH)), is a phyllosilicate 
mineral that in modern marine environments forms at depths of 
>50m and in regions where sedimentation rates are low and pro- 
longed periods of pore-water exchange with seawater occur™. 
However, it has been noted that the distribution and abundance of 
glauconite in Cambrian marine sediments (Fig. 2c) required rapid 
authigenesis under chemical conditions that are different from those 
found on modern marine shelves”. We attribute this anomaly in 
glauconite formation to an unusually large flux of continental chemical 
weathering products, particularly K*, Fe’ and H,SiO, , during 
the formation of the Great Unconformity. Restricted circulation and 
low net rates of sedimentation within the inner detrital belt may 
have also contributed to the unusual distribution and abundance of 
glauconite. 
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Figure 3 | Middle to Late Cambrian palaeoenvironments and 
sedimentology. c, Generalized marine shelf profile'®. b, Cathodoluminescence 
photomicrograph of the Kaili Formation, an outer detrital belt mudstone from 
south China. Lamination tops are defined by authigenic carbonate cements that 
glow orange under cathodoluminescence; clay-rich lamina bases are dark. 

c, Exposure of Notch Peak Formation in Utah. d, Plane-polarized light 


Other records of seawater chemistry provide supporting evidence 
for changes in tectonic activity and enhanced continental weathering 
during the formation of the Great Unconformity. For example, the 
average *’Sr/*°Sr of sea water increased during the Neoproterozoic 
(Fig. 4). This long-term signal is attributable to the increasing concen- 
tration of *’Sr in continental crust due to the decay of *’Rb, and to the 
long-term erosion and progressive exposure of *’Sr-rich granitic rocks 
required to form the Great Unconformity. During the Cambrian, 
8’Sr/°°Sr increased more rapidly to achieve a 900 Myr maximum (that 
is, a maximum over the past 900 Myr) near the end of the Sauk trans- 
gression’, a signal we attribute to enhanced weathering of continental 
crust during the Sauk transgression. Although Phanerozoic *’Sr/*°Sr 
values exhibit many shorter-term oscillations, average *’Sr/*°Sr values 
declined after the Late Cambrian peak for most of the Phanerozoic 
(Fig. 4). We hypothesize that this reversal in the long-term trend in 
seawater *’Sr/*°Sr was driven by progressive Phanerozoic sedimentary 
re-covering of *’Sr-rich continental crust and by increased contribu- 
tions of *°Sr from the hydrothermal alteration of mid-ocean-ridge 
basalt. The 900 Myr Cambrian peak in *’Sr/*°Sr was not approached 
again until the recent, possibly indicating enhanced removal of 


photomicrograph of a carbonate-cemented, glauconite-bearing sandstone 
from the Au Train Formation, inner detrital belt, northern Michigan. e, Great 
Unconformity in Wind River Canyon, Wyoming, with ~510- Myr-old marine 
Flathead Sandstone on 2,900-Myr-old granitic basement. Coin is US dime 
(17.9 mm diameter). gl, glauconite; cal, calcite; qtz, quartz. 


Phanerozoic sedimentary cover during Cenozoic orogenesis and glo- 
bal climate change”. 

Congruent evidence for enhanced weathering of continental crust 
during the Sauk transgression is provided by eNd, an isotopic tracer 
with a short oceanic residence time that is sensitive to the mean age of 
continental crust undergoing chemical weathering. Average seawater 
éNd declined through the Cambrian to reach a long-term minimum at 
the peak of the Sauk transgression’® (Fig. 4), reflecting increasingly 
important input from ancient continental basement rocks as the Sauk 
transgression progressed from relatively young continental margins 
into generally older cratonic interiors. 

Perhaps the strongest evidence for an increased reservoir and flux of 
continental chemical weathering products to the global ocean is pro- 
vided by direct measurements of [Ca**] in evaporite fluid inclusions. 
Concentrations of Ca** in sea water rose precipitously from the 
Neoproterozoic to reach a Phanerozoic high in the Cambrian”. We 
attribute much of this approximately threefold increase in seawater 
[Ca?*] to enhanced chemical weathering of continental crust during 
the Sauk transgression. Elevated seawater [Ca”* ] has been proposed as a 
mechanism for the origin of biocalcification by promoting intracellular 


Rodinia Rodinia breakup Pangea Pangea break. _;_mtns. 
Continental nudat Continental sediment accumulation Young 
ppeetin volcanic arc 
weathering 0 
0.7090- 
--5 
& 0.7080] Global mean eNd 
Ef /-10 eNd 
2 
” 0.70704 
-/-15 
0.70604 Global ®’Sr/*°Sr ~56 Myr sedimentation cycles L209 Y 
Seafloor Glaciations Old continental 
alteration Tonian Cryogen. [Ediacaran [cm | 0 [S[ D |] ¢ p [wl] J K Cz crust 
900 800 700 600 500 400 300 200 100 0 


Geologic time (Myr ago) 


Figure 4 | Summary of major tectonic, geochemical and sedimentary 
patterns over the past 900 Myr. The shift from widespread continental 
denudation to widespread sedimentation on the continents defines the Great 
Unconformity. *”Sr/*°Sr estimates (dark grey envelope) from ref. 2 and references 
therein; eNd estimates (light grey envelope) from ref. 26. Vertical blue bars 
indicate times of major global glaciations; vertical green bars identify continent- 


scale sedimentary sequences", including the Cambrian-Early Ordovician Sauk 
Sequence (dark green bar). Approximate times of widespread continental 
denudation are identified by horizontal red bars at top; time of widespread 
sediment accumulation on continents is identified by partially overlapping light 
yellow bar. Periods of major supercontinents (Rodinia, Pangea) and their break- 
up are identified, including recent mountain building (mtns.). 
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Ca** concentrations that were toxic to some animals*”’. Here we 
provide a mechanism for this increase in seawater [Ca?*], as well as 
other ions involved in biomineralization. 

The palaeogeomorphic surface represented by the Great 
Unconformity is unique in at least the past 900 Myr of Earth history, 
but similar episodes of widespread continental denudation followed by 
large-scale marine transgression and sedimentation may have 
occurred before 900 Myr ago. However, the advent of biomineraliza- 
tion and the Cambrian explosion required the presence of animals, 
which evolved in Cryogenian to Ediacaran time'*®. Our results 
therefore offer a new hypothesis for the timing and origin of bio- 
mineralization and the Cambrian explosion, both of which lag by tens 
of millions of years the initial origin of bilaterian animals”’. Specifically, 
we propose that biomineralization in multiple clades occurred in res- 
ponse to the changes in global ocean chemistry”*” that were promoted 
by the formation of the Great Unconformity and by the expansion of 
shallow marine environments that it records. Although Darwin and 
other palaeontologists have regarded the resultant widespread hiatus 
in the rock record as a failure of preservation, the formation of this 
prominent gap may have actually been an environmental trigger for 
biomineralization, thereby promoting the Cambrian explosion of 
marine animals”. Determining the geodynamic causes of extensive 
Neoproterozoic continental denudation followed by Phanerozoic sedi- 
mentation, and linking those dynamics to the timing and spatial dis- 
tribution of marine transgression and biogeochemical change, is nowa 
challenge for geoscience. 


METHODS SUMMARY 


Stratigraphic data derive from Macrostrat (http://macrostrat.org), which consists 
of 21,521 rock units distributed among 830 geographic regions in North 
America’’. Geological ages for the top and bottom of each Phanerozoic rock unit 
are referenced to one of approximately 82 stage-level time intervals. For this 
analysis, data were binned into 32 Phanerozoic epochs (median duration 
15.6 Myr; see Supplementary Information). 

Macrostrat is comprehensive in the sense that all rock units in each region, 
including those in the subsurface, are included. Rock units have an explicitly 
identified thickness range and dominant lithology in each region. Many units also 
have secondary lithologies and, when prominent and characteristic of the unit, 
additional data on sedimentology, such as mineralogy and sedimentary structures. 

To estimate the total number of sedimentary rock packages that directly overlie 
crystalline basement, all of the rock units older than the base of each Phanerozoic 
sediment package were tabulated in each region. If there were no older sedimentary 
rocks at a location, then that package was identified as resting on basement. This 
approach is conservative with respect to the Great Unconformity because a large 
fraction of Precambrian sedimentary rock is structurally deformed and laterally 
adjacent to basement rocks. Thus, the Great Unconformity often cuts across all 
older rock units at each location’. 

Carbon and oxygen isotopic compositions of 220 outer detrital belt mudstone 
samples from North America and China were obtained by dissolving whole rock 
powders in phosphoric acid and analysing evolved CO, at the stable isotope 
laboratories at the University of California Davis and Riverside. Standard errors 
for all measurements were =0.2%o. Whole rock geochemistry measurements from 
82 outer detrital belt mudstone samples in North America and China were 
obtained from ultrasonically cleaned, twice-fused polished glass beads using a 
Panalytical Axios XRF instrument at Pomona College (see Supplementary 
Information). 
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Clusters of iron-rich cells in the upper beak of pigeons 
are macrophages not magnetosensitive neurons 


Christoph Daniel Treiber, Marion Claudia Salzer’, Johannes Riegler?, Nathaniel Edelman’, Cristina Sugar’, Martin Breuss', 
Paul Pichler’, Herve Cadiou’, Martin Saunders*, Mark Lythgoe’, Jeremy Shaw* & David Anthony Keays! 


Understanding the molecular and cellular mechanisms that mediate 
magnetosensation in vertebrates is a formidable scientific problem’. 
One hypothesis is that magnetic information is transduced into 
neuronal impulses by using a magnetite-based magnetoreceptor**. 
Previous studies claim to have identified a magnetic sense system in 
the pigeon, common to avian species, which consists of magnetite- 
containing trigeminal afferents located at six specific loci in the 
rostral subepidermis of the beak’*. These studies have been widely 
accepted in the field and heavily relied upon by both behavioural 
biologists and physicists”. Here we show that clusters of iron-rich 
cells in the rostro-medial upper beak of the pigeon Columbia livia are 
macrophages, not magnetosensitive neurons. Our systematic 
characterization of the pigeon upper beak identified iron-rich cells 
in the stratum laxum of the subepidermis, the basal region of the 
respiratory epithelium and the apex of feather follicles. Using a three- 
dimensional blueprint of the pigeon beak created by magnetic res- 
onance imaging and computed tomography, we mapped the location 
of iron-rich cells, revealing unexpected variation in their distribution 
and number—an observation that is inconsistent with a role in mag- 
netic sensation. Ultrastructure analysis of these cells, which are not 
unique to the beak, showed that their subcellular architecture 
includes ferritin-like granules, siderosomes, haemosiderin and 
filopodia, characteristics of iron-rich macrophages. Our conclusion 
that these cells are macrophages and not magnetosensitive neurons is 
supported by immunohistological studies showing co-localization 
with the antigen-presenting molecule major histocompatibility com- 
plex class II. Our work necessitates a renewed search for the true 
magnetite-dependent magnetoreceptor in birds. 

Each year millions of birds complete lengthy journeys guided by the 
Earth’s magnetic field. Current evidence indicates that the detection of 
magnetic fields is mediated by an inclination-sensitive light-dependent 
compass that resides in the retina'*!’, and an intensity-sensitive 
apparatus that is believed to provide information about the magnetic 
map, is associated with the trigeminal nerve, and is thought to rely on 
biogenic magnetite (Fe;O,)'*. The trigeminal nerve was first impli- 
cated in magnetoreception in the bobolink Dolichonyx oryzivorus, 
and was suggested to be sensitive to small alterations in magnetic 
stimuli'*. Subsequent studies revealed that the ophthalmic branch is 
required for pigeons to perform an intensity-based conditioning task’, 
and that neurons in the trigeminal brainstem complex of European 
robins are activated when the birds are subjected to non-uniform 
magnetic fields'®. These data have led to the proposition that the 
sensory cells responsible for magnetite-based magnetoreception lie 
in the upper beak of birds*’’. Previous studies*’ have claimed that 
clusters of iron-containing neurons in six specific bilateral locations 
in the rostral dermis of the upper beak of pigeons constitute a magnetic 
sense system’. It has been contended that this system consists of 
unmyelinated dendrites that contain superparamagnetic spherules 
surrounded by iron platelets that are composed of magnetite and 


maghemite, and that the system is a common sensory apparatus in 
birds*’. These assertions have formed the basis for a host of beha- 
vioural studies and theoretical calculations that aim to advance the 
magnetite theory of magnetoreception®'°"**°. 

To investigate this putative magnetic sense system, we undertook a 
systematic analysis of the prevalence and distribution of all iron-rich 
cells in the upper beak of the pigeon. We perfused adult pigeons 
(Nuremberg cohort, n = 12), and sectioned the upper beak from the 
caudal respiratory concha to the tip of the beak in the coronal plane 
(Fig. 1a, b). We stained serial sections (10 lum) with Prussian blue (PB) 
to label ferric iron, and nuclear fast red (NFR) to identify nuclei, 
followed by counting of all PB-positive cells. We consistently observed 
PB-positive cells in three specific regions: (1) in the stratum laxum of 
the dorsal and/or ventral subepidermis; (2) in the buds of feather 
follicles; and (3) in the basal region of the respiratory epithelium 
(Fig. 1c-e). We confirmed this pattern of staining in a larger collection 
of pigeons originating from seven different lofts (n = 172). PB-positive 
cells in all three regions were characterized by the presence of multiple 
dark blue spherules (0.25-5.0 [um in size) and/or by light blue cytoplasmic 
staining with a notable nucleus (Fig. 1g-k and Supplementary Figs 1-3). 
Subepidermal PB-positive cells in caudal and medial regions were pre- 
dominantly found in the dorsal subepidermis (Fig. 1c), whereas those 
PB-positive cells located rostrally were found in the ventral subepidermis 
lining the inner roof of the beak (Fig. 1f and Supplementary Fig. 1). PB- 
positive cells in the feather follicle clustered in the apical region of the 
bud (Fig. 1d and Supplementary Fig. 2), and those in the respiratory 
epithelium were predominantly found within the lateral edges of the 
concha (Fig. le and Supplementary Fig. 3). 

As it is believed that iron-rich cells in the upper pigeon beak are 
limited to six discrete bilateral anatomical loci®, we mapped the distri- 
bution of PB-positive cells along the rostro-caudal axis of the beak. To 
do this accurately we first created a three-dimensional topographic 
map of the pigeon beak by undertaking high-resolution magnetic 
resonance imaging (MRI) coupled with micro-computed tomography 
(micro-CT) scanning, identifying four specific anatomical landmarks 
(Supplementary Movies 1, 2 and Supplementary Figs 4, 5). After stain- 
ing serial sections, we counted PB-positive cells and used our land- 
marks to map the distribution of cells along the rostro-caudal axis 
(n = 12). We found that PB-positive cells in the respiratory epithelium 
and feather follicles were restricted to caudal regions, whereas those in 
the subepidermal region were found in clusters along the length of the 
beak with no apparent bilateralization (Fig. 2a—-c and Supplemen- 
tary Figs 6, 7 and Supplementary Table 1). We found no significant 
differences in the total number of PB-positive cells between sexes 
(respiratory epithelium (P> 0.5), subepidermis (P> 0.1), feather 
follicle (P > 0.1)) (Supplementary Fig. 8), but observed an extremely 
large variation in the number and distribution of PB-positive cells 
when comparing birds of the same age and sex. For instance, pigeon 
200 had ~200 PB-positive cells in the subepidermis, whereas pigeon 
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Figure 1 | PB-positive cells in the upper beak. a, Schematic showing the head 
ofan adult pigeon. The area shaded red indicates the region of the beak that was 
sectioned, stained and screened for PB-positive cells. br, brain; ce, cere; co, 
concha; oe, olfactory epithelium. b, Upper beak coronal section stained with PB 
and NFR (the rostro-caudal location of the section is indicated with a red line in 
a). Scale bar, 2 mm. c, Schematic of a coronal section highlighting the non- 
uniform distribution of PB-positive cells in the subepidermal region in four 
different birds. d, e, Schematic of a feather follicle and concha showing the 
location of PB-positive cells in four birds. f, Schematic of a coronal section 
rostral to the cere, showing the non-uniform distribution of PB-positive cells in 
the subepidermis in three different birds. g, Quantification of NFR staining in 
PB-positive cells revealing that at least 97% of cells in all regions are nucleated 
(n= 5 birds, n = 180 cells), d, dorsal; v, ventral. Error bars show the standard 
deviation. h-k, Representative PB-positive cells in which the nuclei are visible 
(cell membranes are highlighted by a dashed line, nuclei are marked with n). See 
also Supplementary Figs 1-3. Scale bar, 1 jim. 


203 had ~ 108,800 PB-positive cells located in numerous clusters along 
the length of the beak. Although our serial quantification samples a 
section every 120 um we did not find the six 350-j1m-long bilateral 
clusters that are claimed to constitute a magnetic sense system’. We 
speculated that our pigeon strain might harbour a large genomic dele- 
tion that accounts for the absence of this putative magnetosensitive 
system. To investigate this we quantified PB-positive cells in pigeons 
from another loft (Vienna cohort, n = 6). Similar to our Nuremberg 
cohort we did not find six bilateral clusters, and once again observed a 
large variation in the distribution and number of PB-positive cells 
(Supplementary Fig. 9 and Supplementary Table 2). This variation is 
not consistent with a genetically encoded sensory apparatus responsible 
for magnetosensation. 

Next we asked whether PB-positive cells are neurons by triple stain- 
ing sections with PB, NFR, and one of three different antibodies that 
label neuronal structures: neurofilament (NF), TUBB3 and MAP1B 
(n= 5 birds). In the respiratory epithelium we observed 0.04% co- 
localization with NF (n= 1,208 cells), 0.6% co-localization with 
TUBB3 (n= 2,818 cells), and 0.01% co-localization with MAP1B 
(n = 2,213 cells). In the subepidermis we found no co-localization with 
NF (n= 471 cells) or MAP1B (n= 803 cells), and only 0.06% co- 
localization with TUBB3 (n = 1,309 cells). Finally, in the feather follicle 
we found no co-localization with NF (n= 286 cells) or MAPIB 
(n = 295 cells), and only 0.24% co-localization with TUBB3 (n = 407 
cells) (Supplementary Figs 10, 11). The simplest explanation for the 


368 | NATURE | VOL 484 ]|19 APRIL 2012 


a ane 
_ 
Sc ~ 
od = ‘ 
Ze 
£2 oO 
86 
o£ ~ 
gE 
ag 
38 : » 
eae 
26 
cs peer) || il h 
oO Ode a 
zz 
fs) 
b 


Number of PB-positive 
cells per 500 um increment 


c 
o 5 Waves ITN 
2 = 1,000 HH ° ~~ \ 
@ S HOHARC ne 
. Bl le ~ 
OE & Pi it ils 
os Tb0}- a1] le . 
SSO Ne N 
ie : 
Ea oe — ae ~“ 
=) Oster — 
Zz o 
2) 
; = 
1 10 20 30 40 


Increment number 


Figure 2 | Number and distribution of PB-positive cells in the upper beak. 
a-c, Plots showing the number of PB-positive cells per 500 um increment along 
the length of the beak in the respiratory epithelium (blue dots) 

(a), subepidermis (red dots) (b) and the feather follicle (green dots) (¢; n = 12 
birds). Each dot represents the number of PB-positive cells in an individual bird 
in that increment, with males (n = 6) shown in dark colours and females 

(n = 6) in light colours. PB-positive cells in the feather follicle and respiratory 
epithelium are predominantly found in caudal regions, whereas cells in the 
subepidermis are found along the length of the beak but are most prevalent 
beneath the cere. Total cell numbers per increment are plotted on a logarithmic 
axis, highlighting the large variation between birds. Individual birds are shown 
in Supplementary Figs 6 and 7. 


very small amount of apparent co-localization we observed is that two 
cells, one PB positive and the other positive for a neuronal marker, lie in 
the same vertical plane, and because of the nature of the chemical stain 
used cannot be distinguished from one another. Taken together, our 
results strongly suggest that the clusters of PB-positive cells in the beak 
of the pigeon are not neurons. 

To ascertain the true identity of the PB-positive cells, we undertook 
an analysis of their ultrastructure using transmission electron micro- 
scopy (TEM) (n =3 birds) (Supplementary Fig. 12). We observed 
ferritin-like granules (6-9nm) throughout the cytoplasm of PB- 
positive cells from all regions and in some instances haemosiderin 
masses and/or membrane-bound electron-dense organelles known as 
siderosomes*'”” (~300 nm) (Fig. 3a-f and Supplementary Fig. 13). 
Energy-filtered transmission electron microscopy (EFTEM) confirmed 
that each electron-dense granule was composed of iron (Supplementary 
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Figure 3 | Ultrastructure of PB-positive cells. a-f, Representative electron 
micrographs of two different PB-positive cells in the respiratory epithelium 

(a, b), subepidermal region (c, d) and the feather follicle (e, f, n = 3 birds). Cells in 
all regions were found to contain ferritin-like granules (6-9 nm in diameter) that 
are present throughout the cytoplasm. In addition we observed haemosiderin (he) 
clumps and/or membrane bound siderosomes (si). Osmophillic lipid droplets 
(os) are visible in cells in the respiratory epithelium in b. Cells in the subepidermal 
region and feather follicle were noted for their slender cytoplasmic projections 
resembling filopodia, that are seen to engulf a cell in d. Cells in all regions were 
nucleated (nu). See also Supplementary Fig. 13. Scale bars, 1 jum. 


Fig. 14). Selected area electron diffraction (SAED) failed to identify 
any cellular structures that contained magnetite, but showed that 
haemosiderin masses in the feather follicle consist of a goethite-like 
material (1 =3 birds), whereas siderosomes in the respiratory 
epithelium are comprised of ferrihydrite (n = 2 birds) (Supplemen- 
tary Fig. 15 and Supplementary Table 3). On a cellular level, PB- 
positive cells in the respiratory epithelium are characterized by the 
presence of osmophilic lipid vacuoles (Fig. 3b), whereas those cells 
originating from the feather follicle and subepidermal region had 
notable dendritic extensions that resembled filopodia (Supplemen- 
tary Fig. 13d-i, k). In some instances these cytoplasmic tentacles 
appeared to engulf neighbouring cells, suggesting to us that the PB 
cells may be phagocytic macrophages (Fig. 3d). 

Macrophages are known to reside in the spleen, dermis and respiratory 
mucosa of multiple species, and to have a vital role in host defence and 
iron homeostasis”. Iron accumulates within macrophages during 
the catabolism of haemoglobin and is stored as ferritin**. In one class 
of macrophages known as siderophages, ferritin accumulates in 
membrane-bound siderosomes, which can then undergo proteolytic 
processing forming haemosiderin. This accumulation of iron renders 
these cells PB positive”*”*. To ascertain whether the PB-positive cells in 
the upper beak of the pigeon were siderophages, we stained cryosec- 
tions with sera against major histocompatibility complex class II (MHC 
II), which labels antigen-presenting cells including macrophages, 
alongside positive and negative controls (n = 4 birds) (Fig. 4a—c and 
Supplementary Fig. 11)”. We observed MHC II co-localization with 
98.8% of PB-positive cells in the respiratory epithelium (n = 104 cells), 
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Figure 4 | MHC II immunohistochemistry on PB-positive cells. 
a-c, Representative images of coronal sections triple stained with PB, NFR, and 
sera against the antigen-presenting marker MHC II (n = 4 birds). MHC II 
staining was predominant on the surface of cells and found to co-localize 


with =94% of PB-positive cells in all regions. Controls are shown in 
Supplementary Fig. 11. Scale bar, 10 um. 


95% of PB-positive cells in the subepidermis (n = 92 cells), and 94.4% 
in the feather follicle (n = 205 cells). Taken together with our anatom- 
ical mapping, subcellular data and neuronal staining, we conclude that 
clusters of PB-positive cells in the upper beak of the pigeon C. livia are 
macrophages, not magnetosensitive neurons. 

As macrophages are not unique to the upper beak, our finding 
predicts that PB-positive cells should be found throughout the pigeon. 
To test this we stained skin samples from the back, abdomen, neck, 
scalp, wing and lower beak of the pigeon (n = 3 birds). This revealed 
widespread PB-positive staining in the subepidermis and feather 
follicle, which was indistinguishable from that observed in the upper 
beak (Supplementary Fig. 16). Our conclusion further predicts the 
infiltration of PB-positive macrophages in response to tissue damage 
or host invasion. We observed such a response in the beak of one of our 
pigeons (P199), where a large inflammatory lesion with a necrotic 
centre was surrounded by lymphoplasmacytic cells, including 
~80,000 PB-positive cells that were again characterized by constella- 
tions of blue spherules and/or by light blue cytoplasmic staining 
(Supplementary Fig. 17). 

Although we cannot exclude the possibility that a small number of 
sparsely distributed magnetoreceptors reside at an unknown location 
in the upper beak of pigeons, this study finds no evidence to support 
the existence of a subepidermal magnetic sense system that consists of 
iron-containing dendrites at six specific bilateral loci. This conclusion, 
which is supported by a critical analysis of the elemental composition 
of PB-positive cells in the subepidermis”, has several important impli- 
cations. First, it requires a re-evaluation of behavioural studies that 
have purported to impair the function of a magnetite-based receptor in 
the subepidermis of the upper beak and the conclusions that these 
studies reached””°. Second, it necessitates a re-assessment as to whether 
superparamagnetic magnetite has the necessary physical and magnetic 
properties to act as a magnetosensor in a living system'®''”. Third, our 
work reveals that the sensory cells that are responsible for trigeminally 
mediated magnetic sensation in birds remain undiscovered. These 
enigmatic cells may reside in the olfactory epithelium, a sensory struc- 
ture that has been implicated in magnetoreception in the rainbow 
trout”. 

METHODS SUMMARY 


Histological studies. We perfused adult pigeons with 4% phosphate-buffered 
paraformaldehyde (PFA, pH 7.4), and dissected the tissue with ceramic-coated 
tools. We embedded the tissue in paraffin and prepared 10-|1m coronal sections. 
We stained sections for 20 min in 5% potassium hexacyanoferrate with 10% HCl, 
followed by a series of washes and a 2 min exposure to NFR. For immunohisto- 
chemistry we incubated the sections with the primary antibody for 18h in 0.1% 
Triton PBS with 2-4% milk (pH 7.4), before detection using standard methods, 
followed by PB staining. 

Ultrastructure studies. Adult pigeons were perfused with 2.5% glutaraldehyde 
supplemented with 2% PFA in PBS (pH 7.4). Tissue was dissected with ceramic- 
coated tools, incubated for 1h in phosphate-buffered 2% osmium (pH 7.4), 
dehydrated and embedded in epoxy resin. We prepared alternative semi-thin 
(2m) and ultra-thin (70nm) sections. Semi-thin sections were stained with 
PB, and the ultra-thin sections were used for TEM, EFTEM and SAED. 
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Imaging studies. Adult pigeons were perfused with 4% PFA. Following post- 
fixation and mounting, MRI imaging was performed on a horizontal bore 9.4 T 
DirectDrive VNMRS system (Agilent Technologies), and CT on a Nucline Nano 
SPECT/CT imaging system (Mediso). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


PB staining. We optimized our PB-staining protocol on our large cohort of 
pigeons (n = 172) that we sourced from seven different lofts. We experimented 
with a variety of different fixatives, section thickness, and blades before adopting 
the following protocol. Adult homing pigeons (n = 12 Nuremberg cohort, n = 6 
Vienna cohort) were killed before perfusion with 4% phosphate-buffered 
paraformaldehyde (PFA, pH 7.4). The tissue was dissected with ceramic-coated 
tools and post-fixed for 18h before dehydration in an increasing alcohol series. 
The tissue was then embedded in paraffin and sectioned coronally (10 um) with 
ceramic-coated microtome blades (DuraEdgeHigh Profile, BLM00103P). Four 
consecutive coronal sections were mounted on positively charged microscope 
slides (Menzel Superfrost PLUS, Thermo Scientific). For PB staining we incubated 
every third slide in a freshly prepared solution of 5% potassium hexacyanoferrate 
(Sigma, P9387) in 10% HCl for 20 min. After three washes in double distilled HO, 
sections were counterstained for 2 min in NFR (Sigma, 60700). Each slide was then 
scanned (MIRAX Slide Scanner) and PB-positive cells counted manually, using 
light microscopy where necessary. For anatomical mapping, landmarks were 
identified (Supplementary Fig. 5), and 500-um-thick increments determined. 
To obtain estimated total cell counts the number of PB-positive cells counted 
within a normalized increment were divided by the number of sections counted 
within that increment and multiplied by a factor of 50 (see Supplementary Tables 1 
and 2). We compared the number of PB-positive cells in males and females by 
performing a Student’s t-test. All pigeons were sexed using genetic methods as 
previously described**, and experiments were performed in accordance with the 
relevant guidelines and regulations (Magistrat 60, Veterinaramt, MA60-001603/ 
2010/002). 

Immunohistochemistry. For staining with neuronal antibodies slides were de- 
paraffinized and washed in PBS (pH 7.4) before incubation with the primary 
antibody for 18h in 0.1% Triton PBS with 2% milk (pH 7.4). Primary antibodies 
were used at the following concentrations: NF (Millipore, MAB1621, 1:2,000), 
TUBB3 (Covance, MMS-435P, 1:1,000), MAP1B (Santa Cruz, SC-58784, 1:75). 
Following a series of washes in PBS, slides were incubated for 2 h with a biotinylated 
secondary antibody (1:500), before visualization with a peroxidase-based 
Vectastain Elite ABC kit (Vector Labs, PK-4002) and the chromophore DAB 
(3,5-diaminobenzidine, Dako). For MHCII staining, 10-j1m cryosections were pre- 
pared, quenched in 2% HO, in PBS for 30 min, before incubation for 18h in 0.1% 
Triton PBS with 4% milk (pH 7.4) with the primary antibody (Santa Cruz, SC- 
59323, 1:500). This antibody is a mouse monoclonal antibody raised against white 
blood cells originating from the chicken. To avoid cross reaction with endogeneous 
biotin/avidin, a HRP-conjugated secondary antibody was used (Biorad, 1:500), and 
staining visualized with DAB. Sections were thoroughly washed with PBS before PB 
staining and scanning as described above. All cell counting and co-localization 
studies were performed blinded to the antibody used. The overall percentage of 
co-localization was determined by calculating the rate of co-localization per bird, 
and ascertaining the mean. 
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Ultrastructure studies. Adult pigeons were perfused with 2.5% glutaraldehyde 
supplemented with 2% PFA in PBS (pH 7.4) (Glut-PFA), before tissue dissection 
with ceramic-coated tools. Following post-fixation for 48 h, this tissue was washed 
with PBS, incubated for 1h in phosphate-buffered 2% osmium (pH 7.4), 
dehydrated and embedded in epoxy resin. After polymerization, the blocks were 
trimmed and sectioned, alternatively taking semi-thin (211m) and ultra-thin 
(70 nm) sections. The ultra-thin sections were mounted on formvar-filmed copper 
slot grids for TEM, whereas the semi-thin sections were etched with 21% sodium 
ethoxide in ethanol, rehydrated and stained with PB and NFR. Where necessary, PB 
staining was intensified by incubating the sections in 0.5% DAB*!. TEM imaging on 
ultra-thin sections used a 100 kV electron microscope (FEI Morgagni 268D) with a 
CCD camera (Morada Olympus-SIS). For EFTEM imaging and selected area elec- 
tron diffraction, the ultra-thin sections were mounted on holey carbon-filmed 
copper finder grids (Quantifoil, R3.5/1) and analysed on a 200kV TEM (JEOL, 
2100) fitted with a Gatan Imaging Filter (Tridiem) and CCD camera (Orius 
SC1000). For EFTEM, bright-field images were taken before obtaining elemental 
maps for iron, which were acquired using the iron M-edge and generated using the 
conventional three-window method. Two pre-edge (background) images were 
acquired at energies of 45 and 50eV, and the post-edge (signal energy) image 
was acquired by centring the filter’s energy-selecting slit at 59 eV with a slit width 
of 5eV (~10s acquisition time). Diffraction data was obtained from iron deposits 
identified by EFTEM and the data calibrated against a polycrystalline gold standard. 
MRI. Animals were killed and perfused with 4% PFA as described above. 
Following 18h of post-fixation, heads were mounted in a 70-mm diameter PE 
tube filled with proton-free perfluoro-polyether fomblin (Solvay Solexis S.p.A.). 
Imaging was performed on a horizontal bore 9.4 T DirectDrive VNMRS system 
(Agilent Technologies) using a 72 mm quadrature birdcage volume coil (RAPID 
Biomedical GmbH). For three-dimensional imaging, a gradient-echo sequence 
with the following parameters was used: time to echo (TE), 2.8 ms; time to repe- 
tition (TR), 280 ms; flip angle, 40°; six averages, field of view, 70 X 70 X 35 mm; 
matrix size, 512 X 512 X 256. For higher-resolution images of the beak, PFA-fixed 
beaks were incubated with 8 mM gadolinium solution (Magnevist, Bayer AG) in 
PBS for 48h followed by embedding in agar containing 8mM gadolinium. 
Gradient-echo images were acquired using the following parameters: TE, 2.7 ms; 
TR, 25 ms; flip angle, 45°; five averages; field of view, 28 X 28 X 35 mm; matrix 
size, 560 X 560 X 700. Regions of interest were segmented using thresholds with 
manual adjustments where necessary using Amira visualization software (v.5.2.2, 
Visage Imaging). 

CT imaging. CT imaging was performed on a Nucline Nano SPECT/CT imaging 
system (Mediso) using the following imaging parameters: 360 projection, pitch 
0.5, 55 kVp, 145 [1A acquired at 45 j1m isotropic reconstructed to 50 [1m isotropic. 
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Unexpected features of Drosophila circadian 
behavioural rhythms under natural conditions 
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Circadian clocks have evolved to synchronize physiology, meta- 
bolism and behaviour to the 24-h geophysical cycles of the 
Earth’. Drosophila melanogaster’s rhythmic locomotor behaviour 
provides the main phenotype for the identification of higher eukar- 
yotic clock genes~*. Under laboratory light-dark cycles, flies show 
enhanced activity before lights on and off signals**, and these 
anticipatory responses have defined the neuronal sites of the cor- 
responding morning (M) and evening (E) oscillators®’. However, 
the natural environment provides much richer cycling environ- 
mental stimuli than the laboratory, so we sought to examine fly 
locomotor rhythms in the wild. Here we show that several key 
laboratory-based assumptions about circadian behaviour are not 
supported by natural observations. These include the anticipation 
of light transitions, the midday ‘siesta’, the fly’s crepuscular activity, 
its nocturnal behaviour under moonlight, and the dominance of 
light stimuli over temperature. We also observe a third major 
locomotor component in addition to M and E, which we term ‘A’ 
(afternoon). Furthermore, we show that these natural rhythm phe- 
notypes can be observed in the laboratory by using realistic temper- 
ature and light cycle simulations. Our results suggest that a 
comprehensive re-examination of circadian behaviour and its 
molecular readouts under simulated natural conditions will provide 
a more authentic interpretation of the adaptive significance of this 
important rhythmic phenotype. Such studies should also help to 
clarify the underlying molecular and neuroanatomical substrates 
of the clock under natural protocols. 

Drosophila melanogaster provides a prominent model system in 
higher eukaryotes for studying the molecular and neurogenetic basis 
of circadian behavioural rhythms’. The fly’s 24-h rhythmic locomotor 
activity has morning (M) and evening (E) components, interpreted in 
numerous laboratory studies to reflect the fly’s anticipation of regular 
changes in light-dark transitions*® (Fig. 1a). Neurogenetic dissection 
has revealed discrete M and E neuronal clusters that determine these 
circadian landmarks®’. We wondered how the complex naturally cyc- 
ling geophysical environment might affect these behavioural patterns, 
so we investigated fly locomotor rhythms in wild-type and clock 
mutant strains under natural conditions, by placing activity monitors 
outdoors for three seasons (2007-2009), from April to November in 
Leicester, UK (latitude 52° 38’ N), and Treviso, Italy (45° 65’ N) (Sup- 
plementary Fig. 1). 

Generally, and throughout the seasons, clock mutants show very 
high levels of rhythmicity (85-100%) that reflect the cycling environ- 
ment driving (entraining) their behaviour, except Clk in which shows 
decreased rhythmicity (40%) at lower temperatures (Supplementary 
Table 1). Figure 1b-d illustrates locomotor profiles of a natural wild- 
type strain (WTALA) and clock mutants from summer observations 
in Treviso. Contrary to laboratory-based results, at warmer tempera- 
tures the ‘siesta’ (decreased activity during the hottest parts of the day; 


Fig. 1a)*"° was not observed. Instead, and in addition to the well- 
established M and E locomotor peaks, a prominent mid-afternoon 
component, which we term ‘A’, was present in more than 50% of 
wild-type flies at mean daytime temperatures of 20 °C, rising to about 
100% at 27°C (Fig. 1c). Laboratory studies in light-dark cycles at 
constant high temperatures of 29 °C (refs 8, 9), or in rectangular light 
and temperature cycles, do not reveal A components (Fig. la). This 
previously unobserved behaviour was seen in all genotypes at warmer 
temperatures (Fig. 1d and Supplementary Fig. 2) and may represent a 
stress/escape response that is phase-locked to the rise and fall in tem- 
perature. Alternatively, but not exclusively, A may reflect an environ- 
mentally modulated circadian phenotype. We observed that the A 
component was significantly advanced by up to 3h in short-period 
per® mutants in comparison with other per genotypes; per”’ was also 
advanced compared with per” and the wild type, and the A offset was 
significantly delayed in per’ compared with the other genotypes 
(Fig. le and Supplementary Fig. 2), suggesting that A is modulated 
by the clock and is not a simple escape reflex. 

Under laboratory 12 h light-12 h dark cycles, wild-type flies anticip- 
ate lights-on by increasing their locomotor activity about 2 h before the 
lights-on signal, in contrast with arrhythmic mutants such as per”’, 
which do not anticipate the signal®’, Consequently, in nature, if wild- 
type flies anticipate a dawn-related geophysical transition, clock 
mutants would be expected to ‘not anticipate’ and delay their activity 
accordingly. In nature, astronomical, nautical and civil twilights, 
defined by the angles of the Sun below the horizon (18°, 12° and 6°, 
reflecting illumination of approximately 0.001, 0.01 and about 1 lx, 
respectively), accompany the dawn-dusk transitions'’. We therefore 
used nautical twilight, which occurs roughly 1.5-2 h before dawn, as a 
convenient environmental marker by which to compare the onset of 
morning activity (Monset) among genotypes. Seasonally, the average 
daily Monset for WTALA flies in Leicester and Treviso reveals that the 
two locations differ in their overall profiles (Supplementary Fig. 3a). 
However, when the values are replotted against average night temper- 
ature, this difference evaporates, revealing a similar inverse relation- 
ship between Monset and temperature in three wild-type strains 
(Fig. 2a-c), clock mutants and Gal4-driven genotypes (Fig. 2d-j and 
Supplementary Fig. 4). The exception is tim”, for which Monset is only 
marginally temperature-dependent (Fig. 2d) and is rescued by trans- 
formation with the tim* gene, and in per stim”! double mutants 
(Fig. 2e, f). Ablating the Pigment Dispersing Factor (PDF)-expressing 
(Morning) neurons with UAS-hidUAS-rpr, or using the UAS-CycA 
dominant-negative to stop the clock in these cells, failed to disrupt 
the relationship between Monet and temperature (Supplementary 
Fig. 4). 

We investigated whether clock mutants ‘anticipate’ changes in twilight 
differently from the wild type, mirroring laboratory studies®’. We used 
an analysis of covariance (ANCOVA; temperature as covariate) to 
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Figure 1 | The afternoon (A) component of locomotor activity. a, An 
individual male’s (WTALA) locomotor activity in laboratory conditions with 16 h 
light-8 h dark and 22 °C in the dark phase and 30 °C in the light. M, morning; S, 
siesta; E, evening (yellow, light intensity; red, temperature). b, An individual 
male’s (WTALA) locomotor activity on a single summer’s day (yellow, light 
intensity; red, temperature; day temperatures: average 28.7 °C, maximum 34.4 °C, 
minimum 23.4 °C). M, morning; A, afternoon; E, evening. Time 0 = midnight 
(00:00). c, Afternoon peak is associated with temperature (R* = 0.34, F, 35 = 17.7; 
P<0.0002). d, Averaged locomotor plots (means + s.e.m.) for WT ALA and four 
arrhythmic mutants (see Supplementary Fig. 2 for other mutants). WTALA 

(n = 19), per®’ (n = 12) and Clk" (n = 27, with 3 arrhythmic flies) were 
monitored together (average temperature over experiment 28.0 °C, maximum 
34.4 °C, minimum 21.1 °C), whereas tim”! (n = 17) and the double mutant 
per”';tim” (n = 12) were monitored at a different time (average temperature 
29.3 °C, maximum 35.9 °C, minimum 23.1 °C). Red, temperature; yellow, light 
intensity. e, Afternoon component in per mutants compared with wild-type ALA 
(WTALA). Times of Agnset (green squares), Apeak (red circles) and A offset (blue 
triangles) of WTALA were subtracted from the corresponding values for the 
mutants because not all genotypes were run in the same experiment. Negative 
values indicate an earlier phenotype than WTALA. ANOVAs for all parameters 
gave significant genotype effects (F215; = 4.8 (onset), 8.9 (peak) and 29.8 (offset); 
all P< 0.0.01). In post hoc tests, per’ was significantly earlier in all parameters than 
per", and significantly earlier than per” in peak and offset. per” was significantly 
delayed compared with per” in all parameters. 


inspect the average time from nautical twilight that each clock variant 
began morning activity, but only Pdf°, which was slightly delayed, 
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fell significantly outside the range of the three wild types (Fig. 2i and 
Supplementary Table 2a). These results suggest that clock genotypes 
(except tim°) generally anticipate geophysical variables equally. 
Alternatively, if Monset activities are simply temperature-sensitive 
responses to dim light levels, they should be delayed on the following 
morning if the activity monitors are covered up after the previous 
sunset. When we performed this experiment, significant delays of 
about 1h were observed for all genotypes (except tim”), with Monset 
now tracking the morning rise in temperature (Fig. 2k, 1, Sup- 
plementary Fig. 5 and Supplementary Table 2b). The attenuated 
tim” Monset twilight-modulated temperature responses (Fig. 2d and 
Supplementary Fig. 5a) recall the mutant’s disrupted diapause, a 
photoperiodic phenotype also requiring both light and temperature 
input’. We conclude that Monset is a twilight-dependent temperature 
response with little evidence for circadian regulation. 

We next examined evening onset (Egnset), which is seasonally 
delayed relative to the photoperiod during the warmer summer 
months (Supplementary Fig. 3b). We replotted Eonset against average 
daytime temperature, relative to the time of maximum daily temper- 
ature (Tinax3 Fig. 3 and Supplementary Fig. 6). All genotypes showed 
Eonset activities that began successively later as mean daily tempera- 
tures rose above 20 °C (Fig. 3 and Supplementary Fig. 6). At less than 
20°C, the response was stable in relation to T,,,,, with marginally 
significant negative relationships for WTALA, HU and Clk” strains 
(Supplementary Table 3a) and with Eonset generally occurring within 
the 2.25 + 0.17h (mean = s.e.m) that defined the time between the 
maximum daily light intensity and the subsequent Ti,.x (Fig. 3a-i). 
When analysed separately for temperatures below and above 20 °C, 
per® showed the earliest Eonset activities, followed by per” and tim’; 
per’ had the latest (Supplementary Table 3b, c). This is reflected in 
Fig. 3, where the data points fall either mostly above (later, per”) or 
below (earlier, per®, per’, tim”) the upper dotted line representing 
Tmax- These results mirror laboratory findings in which per® and per 
modulate the timing of evening behaviour’; in addition, however, the 
unexpected observation of E components in arrhythmic mutants, and 
their phase advance, indicates an underlying residual short-period 
rhythmicity"'*"°. 

In laboratory wild-type flies, the length of the temperature-sensitive 
afternoon ‘siesta’ is mediated by the thermal regulation of per 3’ splic- 
ing*’°. Eo yset defines the end of the siesta, so we studied whether the per 
splicing readout correlated with the Eonset curve. Levels of per 3’ splic- 
ing in fly heads throughout the seasons for WTALA flies (Fig. 3j) were 
linearly related to temperature over the entire range 7-30 °C, so it is 
unlikely that splicing contributed significantly to E,yset at temperatures 
below 20° C (Supplementary Fig. 6F). Indeed, because the laboratory 
siesta is mediated by the dynamics of the upswing of the clock protein 
PERIOD (PER)*, the fact that per®'-null mutants showed a similar 
Eonset-temperature relationship to that of the wild type also argues 
against a significant role for per 3’ splicing in this natural phenotype. 
A similar conclusion using splice-locked per transformants was 
reached in laboratory experiments using artificial temperature and 
light cycles’”. 

Drosophila activity rhythms in the laboratory are often described as 
‘crepuscular’ because of the distribution of M and E components at 
dawn and dusk'*. We therefore quantified the activity falling in the 
morning and evening twilights (astronomical, nautical and civil) as a 
proportion of all activity falling between the beginning of morning and 
the end of evening astronomical twilights. Using data from Italian 
experiments, we observed that even at the warmest temperatures when 
Monset ANd Eonser are pushed towards the twilights (Figs 1-3), the 
proportion of activity falling within the twilights was less than 25% 
and was reduced still further at cooler temperatures (Fig. 3k). Thus, in 
nature the major proportion of activity, even at high temperatures, falls 
outside the twilights, so flies are diurnal rather than crepuscular. 

Related laboratory experiments have also suggested that locomotor 
activity becomes predominantly crepuscular and nocturnal under 


©2012 Macmillan Publishers Limited. All rights reserved 


d 
300 

r= 150; 

E 

ae ; me %’ 

= ose 

: 

qe 

5 9g h 

oO 

& ‘ 

1 4 . 

% 300 ; : 

5 ‘ ; 


M 
a 
oO 
Sess 
ex* 
es 


LETTER 


+ 
4 
5 | 
4 


Night temperature (°C) 


Figure 2 | The morning onset (Monset) of locomotor activity is dependent on 
temperature and twilight. a-j, Time of Monset in a single day compared with 
nautical twilight (dotted line, time 0). Blue and yellow areas represent the 
approximate time between astronomical and nautical twilight and between 
nautical twilight and sunrise, respectively (both calculated as seasonal averages 
for Treviso). Each point represents the mean of 4-24 flies (median 14); error 
bars represent s.e.m. Blue squares, Leicester; red circles, Treviso. a-c, Wild 


simulated moonlight and this correlates with delayed expression of the 
clock proteins PER and TIMELESS (TIM) in the fifth PDF-negative 
sLNv neuron’””®. We calculated the proportion of WTALA nocturnal 
activity during new moons and immediately before and after full 
moons. Figure 4a reveals no significant differences in nocturnal activity 
between the two conditions, even when we varied the exposure to 
moonlight from 0.03 to 0.33 Ix (Supplementary Fig. 7). When we ana- 
lysed PER in adult brains by immunocytochemistry under conditions 
of full moon and no moon, consistent with the behaviour, PER expres- 
sion in the fifth sLNv (Fig. 4b) was not significantly delayed in moon- 
light in comparison with the other lateral clock neurons (s-LNvs, 
1-LNvs and LNds). The dorsal DN1 and DN2 neurons showed signifi- 
cantly earlier PER peaks than the other neurons (Fig. 4b and 
Supplementary Fig. 8), indicative of a faster underlying oscillation”. 
These results suggest that natural temperature and light cycles may 
counteract the delayed PER expression of the fifth sLNv observed 
under simulated moonlight. 

Finally, we attempted to reproduce natural circadian phenotypes 
within the laboratory by simulating cycling temperature and light 
intensity. We simulated consecutive days of new moon (darkness at 
night), with astronomical twilight (0.005 lx) giving way gradually to 


types: WTALA (Alto Adige) (a), Canton S (b) and HU (Houten) (c). d-j, Clock 
mutants: tim”! (d), tim”!;P[tim*] (e), per” ;tim®! (f), per"! (g), Clk" (h), Pdf? 
(i) and cry”’ (j) (Supplementary Fig. 4). The slope for tim”’ is significantly 
different from that for all three wild types (Supplementary Table 2a). k, 1, Monset 
of wild type (k) and per” mutant (1) before (red) and after (black) activity 
monitors were covered at sunset and maintained until mid-afternoon 
(Supplementary Fig. 5 and Supplementary Table 2b). 


nautical twilight (0.05 Lx), to civil twilight (1 Lx) over a 2-h period at the 
beginning and (in reverse) at the end of the day, rising to a maximum 
intensity of 1,500 lx in the afternoon. We ran this at cycling day-night 
temperatures ranging from 20 to 30°C (spring) or from 25 to 35 °C 
(summer), mimicking actual temperatures that we measured in two 
Treviso spring and summer days, respectively, when the photoperiod 
for both was roughly 16h light-8h dark. We observed that in the 
summer simulation, M, A and E components were prominently dis- 
played by all genotypes (WTALA, per mutants, tim®’ and Clk), 
although Clk”* showed a smaller E burst on most days (Fig. 4c and 
Supplementary Fig. 9). We confirmed that Monset occurs between the 
simulated times of astronomical and nautical twilight, as in the wild at 
these temperatures, with all genotypes becoming active 0.2-0.7h 
before simulated summer nautical twilight, but later (between 0.1 
and 0.4h) before simulated spring twilight (Supplementary Fig. 9c). 
As in the wild, Monset is temperature-sensitive, with no significant 
differences between the wild type and either per” or tim°’ mutants 
in the timing on Monset (although per® and pes were slightly earlier in 
their Monset for both spring and summer simulations), and with tim”! 
showing a similar temperature independence to that in the wild 
(Supplementary Fig. 9c). We observed the circadian modulation of 
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Figure 3 | Evening onset is temperature and clock modulated. a-i, Evening 
onset compared with daily maximum temperature (T;,ax, 0). Dotted line below 
Tmax represents the average time that light peaked (about 2.25 h) before Tmax. 
Blue squares, Leicester; red circles, Treviso. a-c, Wild types: WTALA (Alto 
Adige) (a), Canton S (b) and HU (Houten) (c). d-i, Clock mutants: per”! 

(d), per” (e), per (f), tim”! (g), per” stim” (h) and Clk (i) (see also 
Supplementary Fig. 6 and Supplementary Table 3). Most data points below 
20 °C fall between the times of the light and temperature peaks. per”, tim’! and 
pers have earlier Eonset (ANCOVA: P ~ 0, 0.003 and 0.0007, respectively, versus 
WTALA) as shown by the lower scores on the yaxis; per” has later onset 

(P = 0.0003 versus wild-type). bs per splicing in nature in fly heads. per”? nee 
isoform compared with per’ (per?! + per’™?"4) against average daily 
temperature in WTALA strain. Heads collected at 00:00 (black squares), 03:00 
(red circles), 12:00 (blue triangles) and 15:00 (green triangles). k, Locomotor 
activity is neither crepuscular in natural conditions nor in simulated natural 
conditions. Red circles, mean + s.e.m. of daily crepuscular activity in natural 
condition of groups of WTALA flies in Italy from single multi-day experiments; 
black squares, Italian spring and summer laboratory simulations (see Fig. 4c). 


the E component in our simulations (Supplementary Fig. 9e) and 
whereas nearly all flies, irrespective of clock genotype, showed a prom- 
inent A component in the summer simulation, this proportion fell to 
20-50% in the spring simulation, matching our results from nature and 
revealing the importance of temperature for the expression of A 
(Fig. 4c and Supplementary Fig. 9b, d). We also observed similar pro- 
portions of crepuscular activity for the spring and summer simulations 
to those in our natural observations (Fig. 3k). We did not, however, 
reproduce the advance in the A component of per® compared with the 
wild type that we observed in nature (Supplementary Fig. 9d). 

Our results from the wild reveal a diminished role for ‘anticipation’ 
of light-dark transitions partly because arrhythmic mutants also show 
M and E components, and also because the M component appears to 
be a response to changes in the environment. However, the newly 
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Figure 4 | Diurnal phenotypes of D. melanogaster. a, The proportion of 
nocturnal locomotor activity of WTALA males does not differ between days of 
full moon (open circles) and no moon (filled circles) (ANCOVA: F, 32 = 0.04, 
P= 0.84). Results are means + s.e.m. and are based on 4-24 flies (median 17). 
b, PER immunofluorescence intensity (mean + s.e.m., arbitrary units) in clock 
neurons in May 2008 (left) and September 2008 (right) (new and full moon, 
respectively, corresponding to filled and open circles in a). Red, fifth Pdf-null 
sLNv; brown, sLNvs; green, LNds; turquoise, |-LNvs. DN1 neurons (black) and 
DN2 neurons (purple) are significantly advanced (Fs,49g = 13.71, P ~ 0; see also 
Supplementary Fig. 8). There is a significant advance for all neurons in 
September in comparison with May (Fj,493 = 34.1, P~ 0) but no 

month X neuron interaction (Fs,493 = 0.79, P = 0.56). May, lights on at 05:16, 
lights off at 20:59, photoperiod 15 h 43 min; September, lights on at 06:20, lights 
off at 19:50, photoperiod 13 h 30 min. c, Simulation of Italian summer in the 
laboratory for the wild type and for arrhythmic mutants (16h light-8 h dark, 
temperature range 25-35 °C; see also Supplementary Fig. 9). n = 19-32 for 
each genotype at each temperature. 


described A and the E components are clock-modulated, unlike M. 
This may be because M occurs during the rapid changes in illumina- 
tion from quasi-darkness to bright light, when temperature is constant 
at its daily minimum, whereas A and E components occur when the 
light intensity is peaking or falling rapidly but temperature is either still 
rising or falling gradually. Such conflicting environmental signals may 
have recruited the clock to regulate appropriately adaptive A and E 
locomotor responses. Our results also show that temperature is the 
critical variable for predicting circadian behaviour’’, because the large 
differences in photoperiod between our two locations (more than 3h 
in midsummer) contribute little to the behavioural variance. In con- 
trast, laboratory studies that place temperature and light cycles in 
different phases to each other” reveal that light is the most important 
driver of circadian behaviour. 

Although lateral and dorsal circadian neurons underlying M and E 
behaviour in light-dark cycles at constant temperature have been 
identified®’, subsets of dorsal neurons and lateral posterior neurons 
may be relevant for circadian temperature signalling’ *°. Perhaps in 
the wild it is these neurons that dominate the neuronal circuit. 
However, our observation that various ‘clockless’ genotypes can 
nevertheless show M and E (and A) components also suggests that 
the underlying neuronal substrates for this behavioural programme 
may not reside solely in the lateral and dorsal neurons but also 
elsewhere in the brain. Clock gene expression enhances and modulates 
circadian behaviour in the wild, but the possibility remains that a 
residual short-period rhythmicity resides within per and tim ‘arrhyth- 
mic’ mutants*’*”* that can be amplified under natural conditions. Our 
results resonate with studies of laboratory mice”” and hamsters”, 
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which show them to behave quite differently in nature, where they are 
predominantly diurnal and not nocturnal as in the laboratory. 
Similarly, generally accepted adaptive and mechanistic explanations 
for fly circadian behaviour from laboratory experiments may require 
some revision if they are to account for rhythmicity in nature. 


METHODS SUMMARY 

Fly strains. Strains WTALA and HU are natural isolates collected as isofemale 
lines from northern Italy and The Netherlands’*. Canton-S is a standard laboratory 
strain. per”’, per® and per” are congenic and Cantonized, but the genetic back- 
grounds of the other mutants used per” ;tim”, Clk, Pdf a cry? and Gal4-driven 
transgenes (drivers tim-gal4 and Pdf-gal4), the apoptotic UAS-hidUAS-rpr and the 
dominant-negative UAS-CycA (ref. 29) are not equivalent, and for most are 
unknown. 

Behaviour. Flies were raised at 25 °C in 12 h light-12 h dark cycles in the laboratory 
before males were placed outside at 2-3 days of age in Trikinetics activity monitors. 
Morning (Monset) and afternoon or evening (Aonset» Eonset) onsets were determined 
operationally for each fly on each day of every experiment. Circadian rhythmicity 
was assessed by spectral analysis and autocorrelation as reported previously*”. 
Moonlight measurements. Measurements of moonlight were made in Treviso 
and Leicester, using a LI-210SA Photometric Sensor (LI-COR) connected to a 
LI 1400 data logger. 

Twilights. Twilights were obtained from the online database of the United States 
Naval Observatory (USNO) Astronomy Application Department”’. 
Immunocytochemistry. The slides were examined under a Nikon 80i microscope 
with QICAM Fast Camera. Individual images were taken of planes at different 
depths to create a z-series for each brain. Brains were stained with primary antibodies 
against PDF and PER (the latter was a gift from R. Stanewsky). 

RNA quantification. Flies were collected every 3h for a 24-h time course in 
natural conditions, and fixed in liquid nitrogen. Heads were removed and per” oe. 
per?" forms and the control gene Cbp20 (Cap binding protein 20) were amp- 
lified by PCR. PCR products were revealed on a 2% agarose gel and imaged, and 
final quantification was obtained with ImageJ software. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Fly strains. Strains WTALA and HU are natural isolates collected as isofemale 
lines from northern Italy (Alto Adige 46° 30’ N) in 2004 and from Houton, The 
Netherlands (52° 2’ N) in 2004, respectively. Canton-S is a standard laboratory 
strain. per”, per® and per” are congenic and were backcrossed repeatedly to a 
Cantonized per deficiency Df(1)64j4 background by C.P.K. in the early 1980s. 
The genetic backgrounds of the other fly strains used are varied, and for most 
they are unknown. 

In the laboratory, even at a constant 10 °C, we observed that UAS-CycA (ref. 29) 

was effectively driven by the timgal4 driver and gave an arrhythmic locomotor 
profile compared to the control strain (Supplementary Fig. 10). All flies were raised 
to adulthood at 25 °C in the laboratory before being placed outside at 2-3 days of 
age. We confirmed that UAS-hidUAS-rpr was effective in eliminating the clock 
neurons under tim- and Pdf-gal4 control using immunocytochemistry (data not 
shown). 
Locomotor activity. Locomotor activity was monitored in suburban gardens well 
away from street lighting and from any major roads. Flies were placed in 
Trikinetics activity monitors, and shielded from direct sunlight and from rain 
(Supplementary Fig. 1). In Leicester this was accomplished by placing the monitors 
ona table within a child’s plastic playhouse (Toys R Us) with approximate internal 
dimensions of 4 X 4 X 4 ft?, with open windows in a north-south direction that 
was shaded by an adjacent large tree. Bricks placed on the roof ensured that the 
house did not blow away during high winds. Cables from monitors were fed from 
the back of the playhouse into a computer that was placed inside an immediately 
adjacent extension to C.P.K.’s house. In Italy, monitors were placed under a 
penthouse roof in a garden roughly 4km from the city centre of Treviso, again 
with no direct sunlight, with cables fed to a computer within the penthouse. Flies 
were monitored from March to November (Treviso) or from May to September 
(Leicester). Simultaneous and continuous monitoring of temperature, light 
intensity and humidity were made with recorders placed adjacent to the activity 
tubes. Circadian rhythmicity was assessed by spectral analysis and autocorrelation 
as reported previously”’. 

Morning (Monset) and afternoon or evening (Aonset» Eonset) onsets were deter- 
mined for each fly on each day of every experiment (see Supplementary Fig. 11 for 
examples). Monset Was considered to be present when a bout of activity occurred 
after a period of rest during the dark phase, if continuous movement with a steady 
increase in activity (and no more than one half-hour time bin without any or lower 
activity than the previous bin) led to a peak, followed by a steady decrease in 
activity defining the offset. Activity bouts in the middle of the night (not leading 
continuously into the morning) were not considered. In general, the window of 
time that contained the morning bout was 3h before or after the 30-min time bin 
during which the light intensity reached 11x. If there was no bout of activity 
consistent with the parameters described above, it was noted that there was no 
morning component for that fly on that day. 

Eonset Was considered to be present when a bout of activity occurred after a 
period of rest during the day. If a burst of activity occurred in the middle of the 
afternoon on warmer days, with another clear bout after it, the latter was con- 
sidered to represent the Eonset, whereas the former represented the afternoon 
component. The evening and afternoon bouts of activity had to be composed of 
continuous movement with no more than one zero activity bin interspersed 
within, and with a steady increase and decrease in activity levels defining the 
onset/offset. If there was no bout of activity consistent with the parameters 
described above, it was noted that there was no evening peak for that fly on that 
day. 

Moonlight measurements. Measurements of moonlight were made in Treviso 
and Leicester, using a LI-210SA Photometric Sensor (LI-COR) connected to a 
LI 1400 data logger. Five-minute readings were made four times during selected 
nights of full moon or new moon, and the minimum and maximum light levels 
were recorded, both in fully exposed full moonlight conditions and in the more 
sheltered positions in which the Trikinetics monitors were normally maintained. 
Average maximum nightly full moonlight levels were 0.23 + 0.151x in fully 
exposed positions, in comparison with 0.07 + 0.01 lx in sheltered conditions in 
Leicester (30 November 2009). The corresponding values in Treviso were 
0.33 + 0.07 lx in exposed positions and 0.03 + 0.01 Ix in sheltered conditions. At 
new moon, in both Leicester and Treviso, the light values were 0 Ix. 

Twilights. Twilights were obtained from the online database of the United States 
Naval Observatory (USNO) Astronomy Application Department". 


Natural temperature profiles. These were reproduced in the laboratory by using 
the Memmert ICP 700 incubator with a refrigeration unit. The Celsius 2007 
computer program, running under the Windows XP operating system, was used 
for programming, controlling and documenting the Memmert ICP 700 using the 
USB interface. The incubator can generate smooth cycling temperature ranges 
without step-ups. 

Natural light profiles. These were simulated by using a programmable daylight 
simulator based on a battery of white-light LEDs of different intensities that could be 
programmed to generate smooth cycling profiles (without step-ups) of light ranging 
from 0.0051x (astronomical twilight) to 1,500lx. The front-end programming 
allowed the simulation of any observed natural cycle of light intensity. The equip- 
ment was designed and built by EURITMI, a spin-out from the Electronic 
Engineering Unit at the Venetian Institute of Molecular Medicine (Padova, Italy). 
Immunocytochemistry. Flies were collected every 3h for a 24-h time course in 
natural conditions and fixed for 3 h in 4% paraformaldehyde. After three washes 
(15 min each) in PBS, the brains were dissected under a stereomicroscope (Leica 
MB6). Brains were washed five times for 6 min each in 0.3% PBST (PBS + 0.3% 
Triton X-100). They were then permeabilized for 10 min in 1% PBST and blocked 
for 2h with 1% BSA. After blocking, the brains were incubated for 3 days at 4 °C, in 
the primary antibody diluted in 0.1% BSA in 0.3% PBST. They were then washed 
five times for 6 min each in 0.3% PBST. After washing, they were blocked again for 
1h with 1% BSA. The brains were then incubated overnight at 4 °C in the appro- 
priate secondary antibody. The antibodies used for the immunocytochemistry 
experiments were anti-PDF (Developmental Studies Hybridoma Bank, dilution 
of 1:5,000), anti-PER (from R. Stanewsky; 1:2,500), and anti-mouse and anti- 
rabbit (both from Invitrogen; 1:500). 

Brains were mounted onto slides (VWR) with a drop of mounting medium 
(Vectashield; Vecta Laboratories, Inc.) and covered with a coverslip 0.1 mm thick 
(VWR). The slides were observed under a Nikon 80i microscope with a QICAM 
Fast Camera. Individual images were taken of planes at different depths to create a 
z-series for each brain. The size of the sections forming a z-series was 1.0 + 0.2 um. 
The images were viewed and quantified with ImageJ version 1.42g (http://rsb.in- 
fo.nih.gov/ij/). The average pixel intensity for each neuron was measured together 
with the signal from its corresponding background area. The final amount of 
signal was calculated using the formula intensity = 100 X (signal — back- 
ground)/background. The values of intensity obtained were normalized using 
the higher value as 100% and then plotted with Microsoft Excel 2003 or 
OriginPro 8.0, and statistical analyses were performed with Statistica 8 (StatSoft). 
RNA quantification. Flies were collected every 3h for a 24-h time course in 
natural conditions and fixed in liquid nitrogen. Heads were removed and total 
RNA was recovered and extracted using Trizol Reagent (Gibco) as recommended 
by the kit protocol. Reverse transcription was initiated on total RNA, after any 
contaminating genomic DNA was removed with DNase (Promega), using the 
SuperscriptII (Invitrogen) reverse transcriptase with the 17-bases oligo(dT). The 
reaction was performed for 1 hat 42 °C and for 15 minat 75 °C. per?!"*, per'™Pleed 
forms and the control gene Cbp20 (Cap binding protein 20) were amplified by PCR 
with a PTC-100 Peltier Thermal Cycler (MJ Research) using primers listed below. 
PCR was performed as follows: initial denaturation at 95°C for 3 min, then 28 
cycles consisting of 95°C for 1 min, 62.1°C for 1 min, and 72°C for 45s. The 
reaction was completed by an elongation step of 10 min at 72 °C. Amplifications 
were carried out in 20-l reaction mixtures containing 75 ng of cDNA target, 1 pil 
of each primer (10 1M), 1.6 pl of dNTPs (2mM), 4 ul of Green Buffer (5X) and 
0.4 ul of GoTaq DNA polymerase (Promega). Primers used were as follows: per, 
5'-AAGACGGAGCCGGGCTCCAG-3’ (forward; base pairs 6421-6440; NCBI 
X03636) and 5'-TCTACATTATCCTCGGCTTGC-3’ (reverse; base pairs 7201- 
7221; NCBI X03636); cbp20, 5'-GTCTGATTCGTGTGGACTGG-3’ (forward; 
base pairs 540-559; FlyBase ID FBgn0022943) and 5'-CAACAGTTTGCCATA 
ACCCC-3’ (reverse; base pairs 653-672; FlyBase ID FBgn0022943). 

PCR products were revealed on a 2% agarose (Eurobio) gel under ultraviolet 
radiation. Images were collected with the Quantity One 4.6 (Bio-Rad). The final 
quantification was obtained with Image] software by applying the formula 
intensityRN (pixel) = band intensity of per spliced or per unspliced/band intensity 
of cbp20. Each band’s intensity was corrected for background noise. For each 
collection, three independent RNA extractions, reverse transcriptions, amplifica- 
tions and quantifications were performed. The results were normalized and ana- 
lysed with Statistica8 (StatSoft). The plots were obtained with OriginPro 8.0 
software. 
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Local generation of glia is a major astrocyte source in 


postnatal cortex 


Woo-Ping Ge', Atsushi Miyawaki’, Fred H. Gage*®, Yuh Nung Jan’ & Lily Yeh Jan’ 


Glial cells constitute nearly 50% of the cells in the human brain’. 
Astrocytes, which make up the largest glial population, are crucial to 
the regulation of synaptic connectivity during postnatal develop- 
ment*. Because defects in astrocyte generation are associated with 
severe neurological disorders such as brain tumours’, it is important 
to understand how astrocytes are produced. Astrocytes reportedly 
arise from two sources**: radial glia in the ventricular zone and 
progenitors in the subventricular zone, with the contribution from 
each region shifting with time. During the first three weeks of 
postnatal development, the glial cell population, which contains 
predominantly astrocytes, expands 6-8-fold in the rodent brain’. 
Little is known about the mechanisms underlying this expansion. 
Here we show that a major source of glia in the postnatal cortex in 
mnice is the local proliferation of differentiated astrocytes. Unlike glial 
progenitors in the subventricular zone, differentiated astrocytes 
undergo symmetric division, and their progeny integrate functionally 
into the existing glial network as mature astrocytes that form endfeet 
with blood vessels, couple electrically to neighbouring astrocytes, 
and take up glutamate after neuronal activity. 

Most radial glia have finished producing their share of astrocytes 
and have begun to disappear shortly after birth**; astrocytes are 
therefore thought to derive mainly from progenitors in the subventri- 
cular zone (SVZ) at later stages*. The massive expansion of glia within 
the first three postnatal weeks presents a daunting task for their pre- 
sumed SVZ progenitors. This task is rendered even more challenging 
by the thickening of the cortex compounded by the disappearance of 
radial glia, which provides the migratory tracks for newly formed 
astrocytes’. We used electroporation to transfect green fluorescent 
protein (GFP) plasmids into SVZ/radial glial cells of mice at postnatal 
days (P)0-2 to label them with GFP in vivo and to trace their progeny at 
P16-20 (Fig. la and Supplementary Fig. 1). Only a very small percent- 
age (about 3%) of the astrocytes derived postnatally from SVZ/radial 
glial cells reached cortical layers I-IV; most were left behind in SVZ/ 
white matter (75%) and layers V-VI (22%) (Fig. 1b, c). It therefore 
seems that huge numbers of cortical astrocytes generated postnatally 
might arise from a more efficient process, such as local cell prolifera- 
tion (Ki67" ; Fig. 1d), rather than from SVZ progenitors. Whereas glial 
cell division within the cortex was reported half a century ago, on the 
basis of studies involving [*H]thymidine incorporation into DNA", 
the extent of the contribution of local glial division to postnatal astro- 
cyte production remained unknown, owing to the difficulty in distin- 
guishing glia generated locally from glia derived from other sources. In 
this study, we have obtained evidence to support the hypothesis that 
the local generation of astrocytes within the postnatal cortex is a major 
source of glia. 

To label locally generated glia, we used a replication-defective murine 
leukaemia retrovirus (MLV) to express GFP in infected dividing cells 
and their progeny in postnatal cortex in vivo. This retrovirus specifically 
infects proliferating cells and has been used for cell-fate tracing in SVZ 
and the hippocampal subgranular zone (SGZ) in vivo*"'. We injected 
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Figure 1 | Locally generated glia as a major source of astrocytes. 

a, Procedure to label SVZ/radial glia-derived astrocytes by electroporation. 

b, The distribution of astrocytes (arrows) 2 weeks after electroporation. VZ, 
ventricular zone. c, Percentages of astrocytes at different locations. WM, white 
matter. d, Proliferating cells (Ki67~, red) in a cortical section of P3 mouse. 
Nuclei were stained with 4’,6-diamidino-2-phenylindole (DAPI, blue). 

e, Procedure to label locally proliferating cells by retrovirus. f, Cells labelled by 
retrovirus (green). g, Image of infected astrocytes. Astrocytes (BLBP*, red) 
with GEP (GEP* BLBP*) or without GEP (GFP BLBP* ) in the outlined region 
(dashed line) were included for analysis in h. RV, retrovirus. h, Percentages of 
astrocytes labelled by retrovirus injected locally, calculated as 

100 X (BLBP* GFP" cells/BLBP* cells). Scale bars, 200 jum (b), 50 um 

(d), 500 um (f) and 40 um (g). 
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viruses locally into layers I-IV of the barrel or motor cortex of wild- 
type mice at PO-6 (Fig. le, f) and examined GFP expression 1 week 
later in samples that were also stained with antibodies against brain 
lipid-binding protein (BLBP) (Fig. 1g), which labels radial glia during 
embryonic development and astrocytes in the postnatal brain’. 
Whereas about 30% of infected cells were NG2 glia (27.6%, n = 662 
infected cells, Supplementary Fig. 2), 55-70% of infected cells were 
astrocytes (BLBP’, 56.9%, n= 369 GEP* cells, Fig. 1g; GFAP", 
68.6%, n= 662 GFP* cells, Supplementary Fig. 2), indicating that 


these astrocytes originated locally in vivo. 

To determine whether a major astrocyte source was derived from 
the local generation of glia, we injected retroviruses with higher titre 
(1 ul, (1-3) X 10’) into the cortex of PO-2 mice and compared the 
number of GFP-expressing astrocytes (BLBP* GFP") with the total 
number of astrocytes (BLBP~) within an infected region 7-10 days 
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Figure 2 | Properties of dividing cells within the cortex. a, Nuclei 
(arrowheads, Hoechst 33342) of dividing cells at different mitotic stages in 
acute brain slices. b, A dividing cell (differential interference contrast, arrow) at 
prometaphase (Hoechst 33342, arrowhead). c-e, Voltage responses from an 
Astro-like-D cell (c), a dividing NG2 glia (d) and a dividing SVZ cell 

(e). f-i, Current responses from a non-dividing (ND) astrocyte (f), an Astro- 
like-D cell (g), a dividing NG2 glia (h) anda dividing SVZ cell (i) evoked by step 
voltages (inset, g). j, Current-voltage curves in f-i (the circles indicate the 
positions of measurements). k, Astro-like-D cells (arrows; two at telophase, one 
at metaphase) stained with anti-GFAP (red). 1], m, Morphology of a non- 
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after injection. We found that nearly half of the astrocytes were doubly 
labelled (GEP*BLBP’, 46.8 + 3.8%, n =5 mice; Fig. 1f-h). Because 
the half-life of infectivity of MLV retrovirus is 5-8h at 37°C, 
the doubly labelled astrocytes probably correspond to astrocytes 
undergoing division in the time window of 5-8h plus the progeny 
they generate over the course of 7-10 days. Control studies revealed no 
differences in the morphology of astrocytes (Fig. 1g), the density of 
dividing cells (Supplementary Fig. 3) or the percentage of GFAP- 
occupied area (Supplementary Fig. 4) in brain regions with or without 
retroviral infection. Our observations therefore suggest that local pro- 
liferation is a major source of astrocytes in the postnatal cortex. 

To test the possibility that multiple dividing cell types infected by 
retroviruses, including astrocytes, NG2 glia and perhaps some 
unknown progenitors in the cortex, gave rise to these GFP-expressing 
astrocytes, we labelled acute brain slices with the nuclear marker 
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dividing astrocyte (arrow) and an Astro-like-D cell (arrowhead, I) in the cortex, 
and dividing cells (arrowheads) in the SVZ (m) of a P8 hGFAP-CreER;Ail4 
transgenic mouse. n, Summary of the area covered by the processes of non- 
dividing astrocytes (grey), Astro-like-D cells (red) and SVZ dividing cells (blue) 
(10-m z-projection with soma included, one-way analysis of variance followed 
by a Bonferroni post-hoc test; two asterisks, P< 0.01). 0, A non-dividing 
astrocyte (arrow) was loaded with biocytin. A dividing astrocyte is labelled by 
biocytin (arrowhead). tg, transgenic. Scale bars, 5 [um (a) and 10 ym 

(b, k—-m, o). Error bars indicate s.e.m. 
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Hoechst 33342, a dye that can permeate live cell membranes, to dis- 
tinguish dividing cells from non-dividing cells (Fig. 2a). In some 
experiments we also used slices from CAG-Fucci-Green transgenic 
mice to identify dividing cells in the SVZ and cortex (Supplementary 
Fig. 5). In this line, the green fluorescent protein mAG accumulates 
specifically during the S (synthesis) to M (mitosis) stages of the cell 
cycle’, thus facilitating the identification of dividing cells for whole- 
cell patch-clamp recordings (Fig. 2b). Excluding cells of the vascular 
system, 94.6% (87 of 92 cells, P6-13) of the dividing cells in the cortex fell 
into two groups: dividing NG2 glia’*, with characteristic small sodium 
currents and rectifying current-voltage (I-V) curve (Fig. 2d, h, j) and 
astrocyte-like dividing cells (Astro-like-D; Fig. 2c, g, j), so named 
for their similarity to differentiated astrocytes (Fig. 2f, j). Astrocytes 
characteristically displayed large, delayed rectifier potassium currents 
(Ka) and large, inwardly rectifying potassium currents (K;,) but no 
sodium currents, and they had a linear I-V curve’? (Fig. 2f, j). In con- 
trast, dividing cells recorded in the SVZ (Supplementary Fig. 5) had no 
K;, current and a very small Kg, (Fig. 2e, i, j), typical of immature 
progenitors’®. To further characterize the Astro-like-D cells, our 
immunostaining revealed that they were GFAP” but Nestin” (Fig. 2k 
and Supplementary Fig. 6). We then compared the morphology of 


Figure 3 | Time-lapse imaging of local proliferation of astrocytes. 

a-c, Proliferating astrocytes (arrowheads) in the cortex of hGFAP-GFP 
transgenic mice, at P3 (a), P6 (b) and P14 (c). d, Summarized data for the 
percentage of Ki67*GFP* cells among GFP* cells with strong GFP signals. 

e, f, Sequential images ofa cortical slice from a P3 hGFAP-GFP transgenic mouse 
(e, parent cells (arrows); f, daughter cells (arrowheads)). g, Time-lapse images 
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Astro-like-D cells with that of mature astrocytes or SVZ dividing 
progenitors in hGFAP-CreER;Ail4 transgenic mice. Crossing Ail4 
transgenic mice’? with hGFAP-CreER transgenic mice’* allowed 
robust expression of the red fluorescent protein tdTomato after 
inducible astrocyte-specific Cre-mediated recombination. We admi- 
nistered tamoxifen to hGFAP-CreER;Ail4 transgenic mice at PO-2 
and assessed the cellular morphology 1 week later (Supplementary 
Fig. 7). Astro-like-D cells (Ki67 *tdTomato*) hada complex morpho- 
logy comparable to that of neighbouring mature astrocytes 
(Ki67 tdTomato*; Fig. 21, n and Supplementary Fig. 8). In contrast, 
SVZ dividing progenitors had a bipolar/unipolar morphology 
(Ki67*tdTomato™; Fig. 2m, n and Supplementary Fig. 8). Because 
coupling by means of gap junctions is a hallmark feature of astrocytes, 
we injected biocytin into individual non-dividing astrocytes in brain 
slices of hGFAP-GFP transgenic mice, which express GFP under the 
control of human astrocyte-specific GFAP promoter”, and found that 
Astro-like-D cells (Ki67 *GFP*) and mature astrocytes (Ki67 GEP*; 
Fig. 20) were coupled by means of gap junctions. Thus, unlike SVZ 
dividing progenitors (Fig. 2e, i, j) and glioblasts”®, Astro-like-D cells 
in the cortex are differentiated astrocytes. Taken together with a pre- 
vious report” and our tracing results from NG2-CreBac/ER™™;Ail4 


Ki67* GFP* (%) 


(1h 52 min) ofa dividing GFP” cell ine and f. h, Procedure to image cell division 
in vivo. i-k, Images from a P4 triply transgenic hGFAP-CreER;Ail4;CAG- 
Fucci-Green mouse (i, combined images; j, td Tomato; k, mAG signal; 
arrowheads, dividing astrocytes). 1, m, Time-lapse images at 1 h 35 min (I) and 
18h 58 min (m) from a P5 hGFAP-CreER;Ail4 transgenic mouse (arrowheads, 
dividing astrocytes). Scale bars, 40 um (a-g) and 100 jm (k, m). 


©2012 Macmillan Publishers Limited. All rights reserved 


transgenic mice showing that NG2 glia generated very few astrocytes in 
the cortex during postnatal life (data not shown), our results reveal that 
Astro-like-D cells are the parent cells of locally generated astrocytes. 

To assess the abundance of proliferating astrocytes within the cor- 
tex, we perfused hGFAP-GFP transgenic mice for Ki67 immunostain- 
ing and observed numerous astrocytes in the process of cell division 
before P10 (18.9% at P3, n = 956 GFP* cells; 13.1% at P6, 1 = 619 
GFP* cells; 1.5% at P14, n= 269 GFP* cells; 0.30% at P48-52, 
n= 1,684 GFP" cells; Fig. 3a-d and Supplementary Fig. 9). To directly 
monitor local generation of astrocytes, we performed time-lapse 
imaging of acute cortical slices from hGFAP-GFP transgenic mice 
and found that roughly 2% of astrocytes divided within 3h 
(2.0 + 0.2%, 16 of 809 cells with strong GFP signals from five mice, 
P3-5; Fig. 3e-g and Supplementary Movie 1). Because NG2 glia main- 
tain proliferative ability throughout life’*, and some hippocampal NG2 
glia are reported to have a very weak GFP signal in another hGFAP- 
GFP transgenic line”, we tested whether NG2 glia could have been 
among the dividing cells with a strong GFP signal, by loading biocytin 
into cells with a strong GFP signal through a recording pipette. We 
found that biocytin diffused only among cells with strong GFP signals 
(n =5 slices; Supplementary Fig. 10), which is consistent with our 
observations from doubly transgenic hGFAP-GFP;NG2BacDsRed 
mice (Supplementary Fig. 11). Thus, the dividing cells with strong 
GFP expression are astrocytes rather than NG2 glia. 

The density of dividing cells in acute brain slices showed no obvious 
change within a 6-h period after slice preparation (Supplementary Fig. 
12); however, mature astrocytes could conceivably be induced to 
undergo local gliogenesis by means of a stab wound in vivo”*. We 
therefore performed in vivo imaging with an open skull but an intact 
pial surface within 1h after surgery on the triply transgenic hGFAP- 
CreER;Ail4;CAG-Fucci-Green mice (Fig. 3h). We observed abundant 
dividing astrocytes (12.4%, tdTomato mAG'; Fig. 3i-k, P3-6), a 
similar observation to that in brain sections (Fig. 3a, b, d). Because 
the thinned skull preparation does not cause astrocytic gliosis or 
activation of microglia, we then performed long-term time-lapse 
imaging with the thinned skull preparation in hGFAP-CreER;Ail4 
transgenic mice and obtained similar results: abundant astrocytes were 
generated locally within the cortex (about 8% of progeny, (8 < 2)/212 
cells in Fig. 31, m). Because of the difficulty of identifying a dividing cell 
under thinned skull ifits two daughter cells did not separate completely, 
we probably underestimated the percentage of astrocytes produced on 
the basis of in vivo time-lapse imaging. 

To determine whether dividing astrocytes in the cortex undergo sym- 
metric division to produce astrocytes, or asymmetric division to generate 
multiple cell types as SVZ cells do*”°, we recorded from their progeny 
during and shortly after cytokinesis (Fig. 4a, b). The two daughter cells 
shared similar J-V relationships that were characteristic of astrocytes 
(Fig. 4c). In addition, we examined the daughter-cell morphology in 
P6-8 hGFAP-CreER;Ail4 transgenic mice and found the two daughter 
cells occupying comparable areas and showing similar labelling with the 
astrocyte marker, BLBP (Fig. 4d and Supplementary Table 1). Thus, 
locally dividing astrocytes in the cortex primarily undergo symmetrical 
division to generate two daughter astrocytes. 

To determine whether the progeny maintained their astrocytic fate 
after exiting from the cell cycle, we administered tamoxifen at PO-2 to 
hGFAP-CreER;Ail4 transgenic mice to label astrocytes permanently 
with tdTomato and examined their locally generated progeny 1 week 
later (n = 4 mice) to test whether these tdTomato “ cells still expressed 
the astrocyte marker BLBP. Although there were many progenitors or 
neurons with tdTomato expression in SVZ and hippocampal dentate 
gyrus (Supplementary Fig. 7f, g), nearly all of the tdTomato * cells were 
BLBP* or GFAP* in the cortex (99.8%, motor and barrel cortex; 
Supplementary Fig. 7a—e). Because we found that few astrocytes would 
enter programmed cell death in the cortex (Supplementary Fig. 13), it 
is most likely that the progeny arising from local astrocyte division 
retained astrocytic identity long after exiting from the cell cycle. For 
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Figure 4 | Symmetric division of proliferating astrocytes and the function of 
their progeny. a, A pair of daughter astrocytes (arrowheads) at late telophase 
under differential interference contrast. b, Both cells had GFP signal. Nuclei 
were stained with Hoechst 33342 (HO, inset). c, Voltage responses of two 
daughter cells evoked by step currents (—1 to 6nA). d, Two daughter cells in 
telophase (arrowheads) from a P8 hGFAP-CreER;Ail4 transgenic mouse. e, An 
astrocyte infected by GFP-expressing retroviruses (green) and expressing 
tdTomato (red) formed endfeet (arrows) with blood vessels (Laminin , purple) 
in a P19 hGFAP-CreER;Ai14 transgenic mouse. Tamoxifen was injected at P2, 
and cells were infected with retrovirus at P5. f, The percentage of progeny cells 
marked by retroviruses (GEP ‘ tdTomato” ) that had endfeet (GFP ‘tdTomato~ 
with endfeet, blue). g, A retrovirus-infected astrocyte progeny (GEP*, green, 
arrow) in the absence (Ctrl, upper) or presence (lower) of 100 [1M 
carbenoxolene (CBX) was injected with biocytin (red). Without CBX, both 
GFP* astrocytes (arrowheads) and GFP astrocytes (asterisks) contained 
biocytin (red), as a result of gap-junction coupling with the astrocyte progeny 
injected with biocytin. h, The number of cells coupled. Two asterisks, P< 0.01, 
(unpaired t-test). i, Current responses of uninfected (GFP) and infected 
(GEP*) astrocyte progeny. j, k, Glutamate transporter current (j) and its 
summarized data (k) from infected astrocyte progeny before (black) and after 
(red) application of blocker TBOA (100 UM, 70.4 + 5.3%, n = 7). Two 
asterisks, P< 0.01 (paired t-test). Scale bars, 10 [um (a, b, d, e) and 20 um 

(g). Error bars indicate s.e.m. 


further confirmation, tamoxifen was administered at PO-2 and 
retroviruses were injected locally at P3-5 (3 days after tamoxifen) in 
the cortex of hGFAP-CreER;Ail4 transgenic mice. The fate of doubly 
labelled cells (tdTomato*GEP*) was then assessed 2 weeks later. 
Because tdTomato marked cells that had expressed astrocyte markers 
before retroviral infection (Supplementary Fig. 7), the tdTomato’ GFP * 
cells correspond to the progeny of astrocytes that were infected by 
retroviruses during cell division. We found that all doubly labelled cells 
formed endfoot-like structures with blood vessels (38 of 38 yellow cells; 
Fig. 4e, f), a characteristic of differentiated astrocytes. These results 
demonstrate that locally generated progeny retain astrocytic identity 
long after they exit from the cell cycle. 

We then asked whether daughter astrocytes arising from local 
astrocyte division integrate functionally into the existing glial network 
as mature astrocytes. The intercellular communication by means of 
gap junctions between astrocytes has a critical function in ion buffering 
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in the brain”®. At 1-3 weeks after viral infection, we loaded biocytin 
into infected astrocytes and found that biocytin diffused into 
neighbouring GFP” astrocytes as well as some GFP* astrocytes within 
20 min (26.7 + 2.9 cells, n = 10 slices; Fig. 4g-i and Supplementary 
Fig. 14). The coupling was inhibited by carbonoxelene (CBX), a 
blocker for gap junctions (1.9 + 0.6 cells, n =7 slices; Fig. 4g, h and 
Supplementary Fig. 14), indicating that locally generated astrocytes 
successfully integrated into existing glial networks. Another classical 
function of astrocytes is to clear glutamate from the synaptic cleft”. 
After stimulation of neuronal fibres, an inward current with slow decay 
time course appeared in all the GFP* astrocytes recorded (20 of 20 
GFP* cells, P12-19; Fig. 4j) and was sensitive to the glutamate 
transporter blocker TBOA (Fig. 4j, k). The remaining currents, lasting 
for more than 10s (decay time 13.3 + 0.4, n = 4; Fig. 4j), correspond 
to K;, activation after neuronal excitation’®. These observations thus 
reveal that locally generated astrocytes function as mature astrocytes to 
take up glutamate and K™ ions after neuronal activity. 

We have demonstrated that local generation of astrocytes within the 
postnatal cortex provides a major glial source, at least in layers I-IV, 
whereas astrocytes generated early in development are derived from 
radial glia*°, SVZ progenitors (including SVZ glioblasts and migratory 
glioblasts)*°"°. Once a subset of early astrocytes from those sources 
colonize and differentiate in the cortex as ‘pioneers’, local division of 
these differentiated astrocytes has a pivotal role in glial production 
after birth in the cortex. 

Astrocytic endfeet almost fully cover the blood vessels by postnatal 
day 20 and are crucial to the regulation of cerebral blood flow” and the 
transport of nutrients from blood to neurons”. It is not yet clear how 
this large number of locally generated astrocytes can coordinate with 
angiogenesis to form the complete gliovascular interface. Furthermore, 
aberrant gene activity affecting glial proliferation is one potential cause 
of gliomas, which comprise nearly 80% of primary malignant brain 
tumours’. It will also be of interest to test whether gliomas could have 
arisen from defective regulation of locally dividing glial cells in the brain. 


METHODS SUMMARY 


For live nuclear labelling, slices were incubated with Hoechst 33342 (2 1g ml” ') as 
described previously’. For in vivo imaging, the pups with thinned skull (Fig. 3h, 1, 
m) were immobilized with 4% agarose. The mouth of the pup was attached to a 
1-ml pipette tip that was connected to a tube for inhalation. All data are given as 
means = s.e.m. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals and tamoxifen administration. The CAG-Fucci-Green transgenic line 
was from A.M.’s laboratory, the hGFAP-CreER line was from K.D. McCarthy’s 
laboratory (UNC), and the NG2BacDsRed transgenic line was from A. 
Nishiyama’s laboratory. Both NG2-CreBac*' and NG2-CreER were generated in 
Nishiyama’s laboratory and bought from Jackson Laboratory. The Ail4 transgenic 
mice was from H. Zeng’s laboratory. Tamoxifen inductions were as described”. 
For induction in hGFAP-CreER;Ail4 transgenic mice, an intraperitoneal or sub- 
cutaneous injection of tamoxifen (dissolved in a 1:10 mixture of ethanol and 
sunflower oil) at 3 mg per 40 g of body weight was administered once at the time 
indicated. All animals were treated in accordance with protocols approved by the 
Institutional Animal Care and Use Committee at UCSF. 

In vivo electroporation. Newborn to 2-day-old pups (P0-2) were anaesthetized 
by hypothermia (about 4 min) and fixed to a support with a sticking plaster. GFP 
complementary DNAs were cloned into the chicken B-actin CMV promoter- 
driven expression vector pCAGGS. DNA solution (1-2 il) prepared at 2 mgml ! 
in 10mM Tris-HCl pH 8.0, with 0.04% trypan blue, was injected into the lateral 
ventricle with a pulled-out glass capillary (diameter 50-100 j1m)*’. Animals were 
subjected to five electric stimuli of 50 V, each lasting 50 ms, at 950-ms intervals 
using a square-pulse electroporator BTX830. 

Retroviral preparation and in vivo infection. pCAG-GFP-PRE contains 
replication-defective murine leukaemia virus (MLV)-based retroviral elements 
designed to carry and express enhanced GFP under CMV promoter and CAG 
promoter (modified chicken B-actin promoter with enhanced sequences from 
CMV) with control of the MLV long terminal repeat. We followed the detailed 
protocol for preparation from Gage’s laboratory”’. In brief, three plasmids (pCAG- 
GEP-PRE, pCMV-gp and CMV-vsvg) were transfected to HEK 293T cells with 
Lipofectamine 2000. Viruses containing supernatants were harvested 2 days after 
transfection by centrifugation twice at 65,000g for 2h (Discovery 90SE; Sorvall). 
Final virus titres were about 10°-10’ colony-forming units ml~! as measured by 
infecting HEK 293T cells. Viruses with the GFP reporter gene were injected (1 ul) 
into either C57BL/6 wild-type or hGFAPCreER;Ail4 transgenic mice at P0-9. For 
in vivo infection, pups were anaesthetized with ice for 3-5 min, and the injection 
was performed as described*’. After injection, the pups were put back in a cage 
with a lamp to keep them warm. They were returned to their home cage when fully 
recovered. 

Immunocytochemistry. Mice were perfused with 4% paraformaldehyde in 
1 X PBS. Brains were cut into sections 25-50 1m thick with a cryostat (model 
CM3050S; Leica). Floating sections were permeabilized with 0.25% Triton 
X-100 in 1 X PBS and then blocked for 2h with 5% BSA and 3% normal goat 
serum with 0.25% Triton X-100 in 1 X PBS. Primary antibodies for Ki67 (1:200 
dilution, rabbit, monoclonal; Thermo Scientific), BLBP (1:1,000, rabbit, polyclonal; 
Invitrogen) or Laminin (1:500, rabbit, polyclonal; Sigma) were applied to sections 
alone or in combination and left to incubate for 24-48 h at 4°C. Together with 
DAPI or Hoechst 33342 (1 jg ml; Invitrogen), secondary antibodies conjugated 
with Alexa488, 555, 568 or 633 (1:750) were applied for 2h at room temperature 
(22-25 °C). To identify apoptotic astrocytes, sections were incubated for 15 min 
with 1ugml | propidium iodide after the treatment with 0.2 mgml~’ RNase 
(DNase-free) in 1 X PBS for 30 min at 37 °C as described previously****. 

Slice preparation. Slices were prepared as described previously’. In brief, after 
decapitation, mouse brains were dissected rapidly and sliced with a vibratome 
(VT-1000S; Leica) in ice-cold oxygenated (95% O 2 and 5% COz) artificial 
cerebrospinal fluid solution (aCSF) containing (in mM): 119 NaCl, 2.5 KCl, 2.5 
CaCl,, 1.3 MgSO,, 1 NaH,PO4, 26.2 NaHCO; and 11 glucose. Transverse slices 
(250 pum in thickness) were then maintained in an incubation chamber for at least 
1h at room temperature before whole-cell recording, nuclear dye loading or time- 
lapse imaging. 

Electrophysiology and live cell nuclear labelling. Whole-cell recordings from 
mouse brain slices were conducted with the aid of markers (GFP or 
Hoechst 33342) to identify infected cells or dividing cells. Astrocytes in hGFAP- 
GFP transgenic mice were identified by bright green fluorescence under the 
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microscope. For live nuclear labelling, slices were incubated with Hoechst 33342 
(diluted to 2 ng ml | in aCSE) at room temperature for 30 min as described 
previously’. Recording pipettes were routinely filled with a solution containing 
(in mM): 125 potassium gluconate, 15 KCI, 10 HEPES, 3 MgATP, 0.3 Na-GTP, 5 
Na-phosphocreatine and 0.2 EGTA (pH 7.2-7.4, 290-300 mosM). For glutamate 
transporter currents, pipette solution contained (in mM): 125 caesium gluconate, 
5 CsCl, 10 HEPES, 3 MgATP, 0.3 Na-GTP, 0.2 EGTA and 5 Na-phosphocreatine 
(pH 7.2-7.4, 290-300 mosM). Membrane potential in voltage-clamp mode was 
held at —80mV. Current pulses (20-60 1A, 0.1 ms, 0.05 Hz) were delivered 
through extracellular bipolar electrodes placed roughly 200-300 pm from the cells 
being recorded to induce transporter current. 

Biocytin labelling. Glial cells were filled with 0.1% biocytin (e’-biotinyl-L-lysine; 
Vector Lab) by means of a whole-cell recording electrode, as reported previ- 
ously'>”*. Biocytin was dissolved in the recording pipette solution. Slices were fixed 
overnight with 4% paraformaldehyde at 4 °C before treatment for 2 h with block- 
ing solution containing 5% BSA, 3% normal goat serum and 0.25% Triton X-100. 
Slices were then stained for 2 h with DyLight 549-conjugated streptavidin (1:1,000; 
Vector Lab). In Fig. 20, DyLight 549 was added together with Alexa 633 (second 
antibodies against anti-Ki67) after washing out excess primary antibody against 
Ki67. 

Confocal time-lapse imaging of acute brain slices. GFP™ cells at cortical slices 
from hGFAP-GFP transgenic mice (P3-5) were imaged on a Zeiss LSM510 two- 
photon confocal microscope equipped with objective 20*/0.5W and 63X/0.9W 
(Zeiss). Cells were scanned with xyz mode (four optical slices in z, with 8-um 
interval between slices). The frame interval was 4min for 30-100 frames. 
Projection images were made from z-stacks that included all individual GFP* 
cells. During imaging, slices were kept in a chamber with perfusion of aCSF (see 
above) at 32-34 °C. 

Confocal time-lapse imaging in vivo. The pups (P3-6, hGFAP-CreER;Ail4 
transgenic pups; Fig. 31, m) were anaesthetized by hypothermia: 4-5 min in ice 
wrapped in a piece of cloth. A small fraction of skin (3 mm X 3 mm) was removed 
over an area to be imaged. The pups then were returned to a box for 3-4 h until the 
incision site healed (no bleeding). A high-speed micro-drill was used to thin a 
circular area of skull, typically about 1 mm in diameter. The mouth of the pup was 
attached to a 1-ml pipette tip that was connected to a tube for inhalation. Pups 
were then immobilized with 4% agarose. Imaging was performed using a two- 
photon laser-scanning microscope based on a mode-locked laser system operating 
at 930 nm, equipped with one of the following objectives: 10%, 0.25 numerical 
aperture (NA); 20X, 0.8 NA collected emission more than 560 nm for tdTomato 
and 500-550nm for mAG. Sometimes tdTomato was excited with a laser at 
543 nm. Images were taken every 1.5h for the first 3h, and then the pups were 
put back in a box and allowed to move freely. Additional images were taken every 
9-12h for the following 18-24h. During imaging, pups were fully anaesthetized 
with 2-4% isoflurane for 4-5 min. Cells under the pia were scanned with xyz mode 
(16 slices in z with 10-j1m interval between optical slices.). Shortly after an imaging 
session, isoflurane was turned off and oxygen was left on until the animal fully 
recovered. For Fucci-Green;hGFAP-CreER;Ail4 pups (Fig. 3i-k) we removed a 
small fraction of the skull (1 mm”) and images were taken within 1 h after surgery. 
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Optogenetic stimulation of a hippocampal engram 
activates fear memory recall 


Xu Liu'*, Steve Ramirez'*, Petti T. Pang’, Corey B. Puryear’, Arvind Govindarajan’, Karl Deisseroth” & Susumu Tonegawa! 


A specific memory is thought to be encoded by a sparse population of 
neurons’”. These neurons can be tagged during learning for sub- 
sequent identification’ and manipulation* *. Moreover, their ablation 
or inactivation results in reduced memory expression, suggesting 
their necessity in mnemonic processes. However, the question of suf- 
ficiency remains: it is unclear whether it is possible to elicit the beha- 
vioural output of a specific memory by directly activating a 
population of neurons that was active during learning. Here we show 
in mice that optogenetic reactivation of hippocampal neurons acti- 
vated during fear conditioning is sufficient to induce freezing beha- 
viour. We labelled a population of hippocampal dentate gyrus 
neurons activated during fear learning with channelrhodopsin-2 
(ChR2)’* and later optically reactivated these neurons in a different 
context. The mice showed increased freezing only upon light stimu- 
lation, indicating light-induced fear memory recall. This freezing was 
not detected in non-fear-conditioned mice expressing ChR2 in a 
similar proportion of cells, nor in fear-conditioned mice with cells 
labelled by enhanced yellow fluorescent protein instead of ChR2. 
Finally, activation of cells labelled in a context not associated with 
fear did not evoke freezing in mice that were previously fear- 
conditioned in a different context, suggesting that light-induced fear 
memory recall is context-specific. Together, our findings indicate that 
activating a sparse but specific ensemble of hippocampal neurons that 
contribute to a memory engram is sufficient for the recall of that 
memory. Moreover, our experimental approach offers a general 
method of mapping cellular populations bearing memory engrams. 

An important question in neuroscience is how a distinct memory is 
formed and stored in the brain. Recent studies indicate that defined 
populations of neurons correspond to a specific memory trace’, sug- 
gesting a cellular correlate of a memory engram. Selective ablation or 
inhibition of such neuronal populations erased the fear memory res- 
ponse*®, indicating that these cells are necessary for fear memory 
expression. However, to prove that a cell population is the cellular 
basis of a specific fear memory engram it is necessary to conduct a 
mimicry experiment to show that direct activation of such a popu- 
lation is sufficient for inducing the associated behavioural output”"®. 

The hippocampus is thought to be critical in the formation of the 
contextual component of fear memories'**. Modelling’ and experi- 
mental’®"” studies have demonstrated an essential role of the dentate 
gyrus (DG) of the hippocampus in discriminating between similar 
contexts. Cellular studies of immediate early gene expression showed 
that sparse populations of DG granule cells (2-4%) are activated in a 
given context'*. Moreover, although the same population of DG 
granule cells is activated repeatedly in the same environment, different 
environments” or different tasks” activate different populations of 
DG granule cells. These lines of evidence point to the DG as an ideal 
target for the formation of contextual memory engrams that represent 
discrete environments. 

To label and reactivate a subpopulation of DG cells active during the 
encoding of a memory, we targeted the DG of c-fos-tTA transgenic 


mice’ with the AAV -TRE-ChR2-EYFP virus and an optical fibre 
implant (Fig. la). This approach directly couples the promoter of 
c-fos, an immediate early gene often used as a marker of recent neuronal 
activity”, to the tetracycline transactivator (tTA), a key component of 
the doxycycline (Dox) system for inducible expression of a gene of 
interest’. In our system, the presence of Dox inhibits c-fos-promoter- 
driven tTA from binding to its target tetracycline-responsive element 
(TRE) site, which in turn prevents it from driving ChR2-EYFP 
(enhanced yellow fluorescent protein) expression. In the absence of 
Dox, training-induced neuronal activity selectively labels active 
c-Fos-expressing DG neurons with ChR2-EYFP, which can then be 
reactivated by light stimulation during testing (Fig. 1b, c). We con- 
firmed that our manipulation restricts the expression of ChR2-EYFP 
largely to the DG area of the hippocampus (Fig. 1d-g). 
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Figure 1 | Basic experimental protocols and selective labelling of DG cells by 
ChR2-EYFP. a, The c-fos-tTA mouse was injected with AAVo-TRE-ChR2- 
EYFP and implanted with an optical fibre targeting the DG. b, When off Dox, 
training induces the expression of tT A, which binds to TRE and drives the 
expression of ChR2-EYFP, labelling a subpopulation of activated cells (yellow) 
in the DG. ¢, Basic experimental scheme. Mice were habituated in context A 
with light stimulation while on Dox for 5 days, then taken off Dox for 2 days and 
fear-conditioned (FC) in context B. Mice were put back on Dox and tested for 5 
days in context A with light stimulation. d, Representative image showing the 
expression of ChR2-EYFP in a mouse that was taken off Dox for 2 days and 
underwent FC training. e-g, An image of each rectangular area in d is 
magnified, showing the DG (e), CA1 (f) and CA3 (g). The green signal from 
ChR2-EYFP in the DG spreads throughout entire granule cells, including 
dendrites (e), whereas the green signal confined to the nuclei in CA1 and CA3 is 
due to a 2-h half-life EGFP (shEGFP) expression from the c-fos-shEGFP 
construct of the transgenic mouse (f, g). Blue is nuclear marker 4’,6-diamidino- 
2-phenylindole (DAPI). Panel d is at X10 magnification and panels e-g are at 
X50 magnification. 
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First, to characterize the inducible and activity-dependent expres- 
sion of ChR2-EYFP, we examined its expression timeline under 
various treatments (Fig. 2a-h). We observed a complete absence of 
ChR2-EYFP expression in DG neurons while mice were on Dox 
(Fig. 2a). Two days off Dox was sufficient to induce ChR2-EYFP 
expression in home-caged mice (Fig. 2b). The number of ChR2- 
EYFP-positive cells increased substantially in response to 2 days off 
Dox followed by fear conditioning (FC; Fig. 2c). We found that the vast 
majority of c-Fos-positive cells were also ChR2-EYFP positive 
(Supplementary Fig. 1), confirming that activity-dependent labelling 
with ChR2-EYFP recapitulated the induction of endogenous c-Fos. A 
similar increase in ChR2-EYFP expression was seen in a group of mice 
that were exposed to the same context and tone as the FC group but 
had no shocks delivered (NS; Fig. 2d). ChR2-EYFP expression lasted 
at least 5 days (Fig. 2e) and was gone by 30 days (Fig. 2f). Kainic- 
acid-induced seizures resulted in complete labelling of DG cells with 
ChR2-EYFP (Fig. 2g), indicating that the relatively sparse labelling in 
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Figure 2 | Activity-dependent expression and stimulation of ChR2-EYFP. 
a-g, Representative images of the DG from c-fos-tT A mice injected with AAVo- 
TRE-ChR2-EYFP and killed after the following treatments: on Dox (a); off Dox 
for 2 days in home cage (b); same as b followed by FC (c); same as ¢ except no 
shock was delivered (NS; d); same as ¢, 5 days after training (e); same as c, 30 
days after training (f); same as b followed by kainic acid injection to induce 
seizure (g). Residual green signal in a and f is from nuclear-localized c-fos- 
shEGFP (see Fig. 1 legend). h, Percentage of ChR2-EYFP-positive cells after 
various treatments represented by a—g (1 = 5 subjects each; Fe 13 = 94.43, 

*P < 0.05; ***P < 0.001).N.S., not significant. i, j, Representative DG cells after 
light stimulation in c-fos-tTA mice injected with AA Vo-TRE-ChR2-EYFP (i) or 
AAV,-TRE-EYFP (j). k, Percentage of c-Fos-positive cells among ChR2- 
EYFP-positive cells or EYFP-positive cells after light stimulation (n = 3 subjects 
each; ***P < 0,001). 1, Light-evoked single unit activity of a DG neuron froma 
c-fos-tTA mouse injected with AAVo-TRE-ChR2-EYFP. Peri-event histogram 
(top) and raster plot (bottom) show reliable and precisely time-locked spiking 
relative to the onset of 15 ms light pulses (blue bar). Inset shows an overlay of 
waveforms for all the spikes during light stimulation. m, Spike probability and 
peak latency for all the light-responsive cells (n = 10) recorded as in 1. n, Multi- 
unit activity in the DG from a c-fos-tTA mouse injected with AAVo-TRE- 
ChR2-EYFP in response to trains of 10 light pulses (15 ms; blue bars) at 20 Hz. 
Scale bar in a, 250 um. Panels a-g are at X10 magnification and panels i, j are at 
X80 magnification. Error bars show mean + s.e.m. 
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the FC or NS groups was not due to the low infection rate of the virus, 
but reflected the naturally low activity of DG neurons during the 
training sessions'*’*, Notably, NS and FC treatments resulted in 
similar proportions of ChR2-EYFP-positive cells (Fig. 2h). ChR2- 
EYFP expression after FC seemed to be restricted to the excitatory 
neurons, as no overlap was detected between ChR2-EYFP-positive neu- 
rons and GABA-positive inhibitory neurons (Supplementary Fig. 2). 

We injected c-fos-tT A mice with either AAV -TRE-ChR2-EYFP or 
AAVo,-TRE-EYFP, subjected them to FC while off Dox, and then put 
them back on Dox to test for light-evoked responses from DG cells the 
following day. The mice were anaesthetized for in vivo recordings, and 
blue light pulses (473 nm, 0.1 Hz, 15 ms pulse duration) were delivered 
to the DG. Consistent with the sparse labelling of DG neurons 
(Fig. 2h), we identified only ten DG neurons that responded to light 
stimulation from nine c-fos-tTA mice injected with AAVo-TRE- 
ChR2-EYFP (the ChR2 group). In these neurons, we detected a reliable 
increase of spike probability precisely time-locked to the onset of light 
pulses (Fig. 21, m). These cells also showed robust responses to trains of 
20 Hz light stimulation, with a slight decrease in spike probability over 
time that remained higher above baseline (Fig. 2n). We did not find 
any light-responsive cells in the ten c-fos-tTA mice injected with 
AAVo-TRE-EYFP (the EYFP group; data not shown). Most of the 
ChR2-EYFP-positive cells in the ChR2 group of mice were also positive 
for endogenous c-Fos after optical stimulation, although not all c-Fos- 
positive cells expressed ChR2-EYFP. Very few neurons expressing 
EYFP in the EYFP group of mice were c-Fos positive (Fig. 2i-k and 
Supplementary Fig. 3). The proportion of c-Fos-positive cells in the 
downstream CA3 region was greater in the ChR2 group compared with 
the EYFP group after optical stimulation of DG neurons, and this 
number was comparable to the proportion of CA3 c-Fos-positive cells 
obtained by exposing a separate group of mice to the conditioned 
context after FC (Supplementary Fig. 4). 

Next, we tested whether activating a population of DG neurons 
labelled by ChR2-EYFP during the encoding of a fear memory was 
sufficient for memory recall. The experimental group (Exp) consisted 
of c-fos-tTA mice unilaterally injected with AAVo-TRE-ChR2-EYFP 
and implanted with an optical fibre targeting the DG (Fig. 1a). Mice 
were kept on Dox and underwent 5 days of habituation to record their 
basal level of freezing in one context (context A) during both light-off 
and light-on epochs. Next, they were taken off Dox and underwent FC 
in a distinct chamber (context B) in which a tone was paired with 
shock. The mice were then subjected to 5 days of testing with light- 
offand light-on epochs in context A while on Dox (Fig. 1c). During the 
habituation sessions, the Exp mice showed very little freezing during 
either light-off or light-on epochs. In contrast, after FC, freezing levels 
during light-on epochs were higher compared with light-off epochs, 
which indicated light-induced fear memory recall (Fig. 3a). Increased 
freezing during light-on epochs was observed over all 5 days of test 
sessions with no discernible day-dependent difference (Supplemen- 
tary Fig. 5g). These data suggest that DG cells that express endogenous 
c-Fos during training, and therefore become labelled by ChR2-EYFP, 
define an active neural population that is sufficient for memory recall 
upon subsequent reactivation. 

To rule out the possibility that post-training freezing by optical 
stimulation was due to the activation of DG cells unrelated to fear 
learning, we injected another group of mice (NS) with AAVo-TRE- 
ChR2-EYFP and administered the same habituation, training, and test 
sessions as the Exp group, except that no shock was delivered during 
the training session. Despite the fact that a similar level of ChR2-EYFP 
expression was detected in the NS group compared with the Exp 
group, both in terms of proportion of cells labelled (Fig. 2h) and 
ChR2-EYFP fluorescence intensity per cell (Supplementary Fig. 6), 
light did not induce post-training freezing in the NS group (Fig. 3b). 
This indicates that the freezing observed in the Exp group requires 
optical activation of a specific subset of ChR2-EYFP-positive DG cells 
that are associated with FC and that activating a population of DG cells 
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Figure 3 | Optical stimulation of engram-bearing cells induces post- 
training freezing. a, c-fos-tTA mice injected with AAVo-TRE-ChR2-EYFP 
and trained with FC (Exp group) showed increased freezing during 3-min light- 
on epochs. Freezing for each epoch represents 5-day average (Supplementary 
Fig. 5a, g). Freezing levels for the two light-off and light-on epochs are further 
averaged in the inset (n = 12, F,22 = 37.98, ***P < 0.001). b, Mice trained 
similarly to the conditions in a but without foot shock (NS group) did not show 
increased light-induced freezing (n = 12). N.S., not significant. c, Mice injected 
with AAVo-TRE-EYFP and trained with FC (EYFP group) did not show 
increased light-induced freezing (n = 12). d, Mice trained similarly to the 
conditions in a but kept off Dox for 1 day before FC training (Exp-1day group) 
showed greater freezing during test light-on epochs compared to Exp group 
(n= 5, Fy = 38.26, ***P < 0.001). e, Mice trained similarly to the conditions 
in a but bilaterally injected with AAVo-TRE-ChR2-EYFP and implanted with 
optical fibres (Exp-Bi group) showed even higher levels of freezing during test 
light-on epochs (1 = 6, F,,19 = 85.14, ***P < 0.001). f, Summary of freezing 
levels of the five groups during test light-on epochs (F4,4. = 37.62, *P < 0.05; 
***D < (),001). Error bars show mean = s.e.m. 


not associated with FC does not induce freezing. Yet another group of 
mice (EYFP) were injected with AAVo-TRE-EYFP and underwent 
identical habituation, training, and testing sessions as the Exp group. 
The proportion of cells expressing EYFP was comparable to that seen 
in the Exp group expressing ChR2-EYFP (Supplementary Fig. 7). 
However, the EYFP group did not show increased post-training freez- 
ing (Fig. 3c). This result rules out the possibility that increased freezing 
in the Exp group was due to any non-specific effects of post-training 
optical stimulation. 

The light-induced freezing levels of the Exp group were relatively 
low (~15%) compared with those typically reported from exposure to 
a conditioned context (~60%)°. One possibility is that light activation 
of background-activity-induced ChR2-EYFP (Fig. 2b) interfered with 
the expression of the specific fear memory. We confirmed that limiting 
the off-Dox period from 2 days to 1 day reduced the background 
expression of ChR2-EYFP by at least twofold (compare Sup- 
plementary Fig. 8a home cage with Fig. 2h home cage). A group of 
mice (Exp-1day) that went through the same design outlined in Fig. 1c 
but with this modification showed greater freezing levels (~25%) 
during the light-on epoch of test sessions compared to the Exp group 
(Fig. 3d, f). Another possible factor contributing to the modest light- 
induced freezing in the Exp group may be the limited number of cells 
optically stimulated. To test this possibility, we bilaterally injected a 
group of mice (Exp-Bi) with AAVo-TRE-ChR2-EYFP and bilaterally 
implanted optical fibres targeting the DG, and then subjected these 
mice to the same scheme as that shown in Fig. 1c. During the light-on 
epochs of the test sessions, the Exp-Bi group exhibited levels of freezing 
(~35%) that were almost as high as those induced by the conditioned 
context (Fig. 3e, f, Supplementary Fig. 9 and Supplementary Movies). 

We next examined whether the light-induced fear memory recall was 
context-specific. First, to test whether two different contexts activate 


LETTER 


similar or distinct populations of DG cells, we took the mice off Dox 
for 2 days and then exposed them to a novel context (context C, an 
open field) to label the active DG cells with ChR2-EYFP. After being 
put back on Dox, the mice were fear-conditioned in a different context 
(context B) and killed 1.5h later (Fig. 4a). The expression of ChR2- 
EYFP was used to identify cells previously activated in context C 
whereas endogenous c-Fos was used to identify cells recently activated 
in context B. Immunohistochemical analyses revealed a chance level of 
overlap between ChR2-EYFP-positive and c-Fos-positive cells, sug- 
gesting that two independent DG cell populations were recruited for 
the representation of the two distinct contexts (Fig. 4b-g). To test the 
context specificity of light-induced recall of a fear memory, we sub- 
jected a new group of mice (an open field fear-conditioned group; OF- 
FC) to habituation sessions in context A, followed by 2 days off Dox 
and exposure to context C to label neurons active in context C with 
ChR2-EYFP. Next, we put the mice back on Dox and performed FC in 
context B (Fig. 4h). These mice were then placed back in context A and 
tested for light-induced freezing. Light failed to evoke an increase in 
freezing responses (Fig. 4i). Similarly low levels of freezing were 
observed in another group of mice (FC-OF) in which FC in context 
B while on Dox preceded exposure to context C while off Dox 
(Supplementary Fig. 10). Together, these results indicate that light 
reactivation of cells labelled in context C did not induce fear memory 
recall associated with context B. 
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Figure 4 | Labelling and stimulation of independent DG cell populations. 
a, c-fos-tTA mice injected with AAV -TRE-ChR2-EYFP were taken off Dox 
and exposed to context C to label activated cells with ChR2-EYFP (yellow), 
then put back on Dox and trained with FC in context B to activate endogenous 
c-Fos (red). b-e, Representative images of DG from these mice are shown. 
b, ChR2-EYFP-labelled cells activated in context C. c, c-Fos-labelled cells 
activated in context B. d, Nuclear marker DAPI. e, Merge. The white and red 
circles show examples of ChR2-EYFP-positive and c-Fos-positive cells, 
respectively. The c-Fos-positive cells in e appear yellow because they express 
both endogenous c-Fos (red) and the nuclear-localized c-fos-shEGFP (green) 
(see Fig. 1 legend). f, Percentage of ChR2-EYFP-positive, endogenous c-Fos- 
positive, and double-positive cells among total cells (DAPI+) (n = 5). g, The 
observed percentage of double-positive cells is the same as what would be 
expected if the two cell populations were independent (that is, a product of the 
observed percentage of ChR2-EYFP single-positive and c-Fos single-positive 
cells). h, Behaviour setup for mice exposed to an open field in context C while 
off Dox and subsequently fear-conditioned in context B while on Dox (OF-FC). 
i, OF-FC mice (n = 5) do not show increased light-induced freezing. N.S., not 
significant. Panels b-e are at X80 magnification. Scale bar in b, 10 jum. Error 
bars show mean = s.e.m. 
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We have shown that optical activation of hippocampal cells that 
were active during FC elicits freezing behaviour. To our knowledge, 
this is the first demonstration that directly activating a subset of cells 
involved in the formation of a memory is sufficient to induce the 
behavioural expression of that memory. Our results and previous 
studies that addressed the necessity of similarly sparse cell populations 
in the amygdala®*® argue that defined cell populations can form a 
cellular basis for fear memory engrams. The memory engram that 
we selectively labelled and manipulated is probably contextual in 
nature, as previous studies have demonstrated that hippocampal inter- 
ventions affect conditioned freezing responses to a context but not a 
tone’*!***, Indeed, recent findings show that optogenetic inhibition of 
the hippocampal CA1 region during training or testing inhibited the 
recall of a contextual fear memory, while leaving auditory-cued fear 
memory recall intact’*. However, we cannot completely rule out the 
possibility that the fear memory recalled in our experiments may have 
some tone memory component. 

Our observation that freezing responses were elicited by optical 
stimulation in the experimental groups (Exp, Exp-1day and Exp-Bi), 
but not in the OF-FC or FC-OF group, strongly supports a dual 
memory engram hypothesis of contextual FC****. In this hypothesis, 
hippocampal cells are recruited to form contextual memory engrams, 
but these contextual engrams alone do not represent a complete fear 
memory. For a fear memory to be formed, the information from the 
contextual memory engram must be transferred to the basolateral 
amygdala coincidentally with the information representing a foot 
shock. In the OF-FC or FC-OF scheme, two distinct contextual 
memory engrams were formed in the DG, which were represented 
by two distinct sets of DG cells. One of these two contextual engrams 
(the one for context B) was associated with the representation of the 
shock, but not the other engram (the one for context C). Because only 
the latter, but not the former, was labelled by ChR2, optical stimulation 
could not elicit fear memory expression. 

Although we have demonstrated the sufficiency of a DG memory 
engram for the behavioural expression of a fear memory, we cannot 
conclude that this engram is necessary for behavioural recall. During 
contextual FC, it is likely that multiple contextual memory engrams are 
formed in a series of hippocampal regions. Each of these engrams may 
contribute to the formation of the complete fear memory in the BLA 
and may also be capable of reactivating it independently, as we 
observed in the case of the DG engrams. Because the hippocampus 
is not a linear feed-forward network but contains several parallel 
circuits, inhibiting the formation or activation of contextual engrams 
in one region may not necessarily block the expression of the fear 
memory. For instance, disruption of contextual memory engrams in 
the DG could be circumvented by CA1 engrams, which could be 
generated through the direct input from the entorhinal cortex and 
may be sufficient to activate the fear memory engram in the BLA. 
Indeed, we recently generated a mouse mutant, which permitted us 
to demonstrate that the DG input to the CA3 is dispensable in the 
formation and retrieval of contextual fear memory”. 

The approach and methods described in this work will be a powerful 
tool for mapping multiple component engrams, each contributing to 
an overall memory. A multifaceted analysis of these engrams and their 
interplay will reveal the nature of the overall memory engram. 


METHODS SUMMARY 

Virus-mediated gene expression. The pAAV-TRE-ChR2-EYFP and pAAV- 
TRE-EYFP plasmids were constructed by standard methods and packed as 
AAV, viruses. The viruses were stereotaxically injected into the DG (0.15 pl). 
Immunohistochemistry. Mice were killed after various treatments and brain 
slices were prepared for immunohistochemistry and confocal microscopy. 
Coronal sections were immunostained for EYFP and/or c-Fos. All imaging and 
analyses were performed blind to the experimental conditions. 

In vivo recording. An optrode consisting of a tungsten electrode glued to a 
200 um optical fibre coupled to a 473 nm laser was used for optical stimulation 
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and extracellular electrical recordings in head-fixed, isoflurane-anaesthetized 
mice. 

Behavioural tests. Mice used for behavioural tests were injected with AAV» virus 
and implanted with an optical fibre targeting the DG. All mice were habituated in 
context A while on 40mgkg ’ Dox for 5 days for 12 min per day with light 
stimulation (473 nm, 20 Hz, 15 ms) during minutes 4-6 and 10-12. The groups 
of mice were taken off Dox for 1 (Exp-1day) or 2 days (Exp, Exp-Bi, EYFP and NS) 
and fear-conditioned in context B with a tone, with (Exp, Exp-1day, Exp-Bi and 
EYFP) or without (NS) shock. The OF-FC group was taken off Dox for 2 days and 
exposed to context C without shock, then fear-conditioned in context B while on 
Dox. The FC-OF group was fear-conditioned in context B while on Dox, then 
taken off Dox for 2 days and exposed to context C without shock. All groups were 
put back on Dox and tested in context A for 5 days in a manner similar to that used 
in the habituation sessions. All groups except the NS group were then put back to 
context B for a contextual fear probe trial 1 day after the last light stimulation, 
followed by a cued tone probe trial in context D the next day. Freezing levels were 
scored by experimenters blind to all treatment conditions. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Subjects. The c-fos-tTA mice were generated from TetTag mice by breeding them 
with C57BL/6J mice and selecting those carrying only the c-fos-tTA transgene and 
not the bi-cistronic tetO promoter driving tau-LacZ and tTA"!°” transgenes. 
These mice also contained a separate transgene consisting of a c-fos promoter 
driving the expression of nuclear-localized 2-h half-life EGFP (shEGFP), which 
is distinct from the whole-cell-localized ChR2-EYFP. Mice had food and water 
ad libitum and were socially housed until the beginning of the surgery. The mice 
were 8-12 weeks old at the time of surgery and had been raised on food containing 
40 mg kg ' Dox for 4 weeks before surgery. Mice were single housed post-surgery 
and throughout the rest of the experiments. All procedures relating to mouse care 
and treatment conformed to the institutional and National Institutes of Health 
guidelines. 

Virus construct and packaging. The pAAV-TRE-ChR2-EYFP plasmid was con- 
structed by cloning TRE-ChR2-EYFP into an AAV backbone using the Spel 
restriction site at the 5’ terminus and the blunt end at the 3’ terminus of the insert. 
The pAAV-TRE-EYFP plasmid was constructed by removing the ChR2 fragment 
from the pAAV-TRE-ChR2-EYFP plasmid using Nhel and Agel restriction sites, 
blunting with T4 DNA polymerase, and self-ligation of the vector, which retained 
the ATG start codon of the EYFP gene from the ChR2-EYFP fusion gene. The 
recombinant AAV vectors were serotyped with AAV, coat proteins and packaged 
by the Gene Therapy Center and Vector Core at the University of Massachusetts 
Medical School. Viral titres were 1 X 10’? genome copy ml‘ for AAVo-TRE- 
ChR2-EYFP and 1.5 X 10'* genome copy ml’ for AAV -TRE-EYEP. 
Stereotactic injection and optical fibre implant. All surgeries were performed 
under stereotaxic guidance. Mice were anaesthetized using 500 mgkg ! avertin. 
The virus was injected using a glass micropipette attached to a 10 pl Hamilton 
microsyringe (701LT; Hamilton) through a microelectrode holder (MPH6S; WPI) 
filled with mineral oil. A microsyringe pump (UMP3; WPI) and its controller 
(Micro4; WPI) were used to control the speed of the injection. The needle was 
slowly lowered to the target site and remained for 10 min before the beginning of 
the injection. Mice for timeline studies and head-fixed electrophysiology recordings 
were injected bilaterally (—2.2 mm anterioposterior (AP); + 1.3 mm mediolateral 
(ML); -2.0mm dorsoventral (DV)”’) with 0.15 pl AAVg virus at a rate of 0.1 pl 
min |. After the injection the needle stayed for five additional minutes before it was 
slowly withdrawn. The mice used for behaviour tests were unilaterally or bilaterally 
injected with the virus same as described above. After withdrawing of the needle, a 
Doric patchcord optical fibre (200 j1m core diameter; Doric Lenses) precisely cut to 
the optimal length was lowered above the injection site (-2.2 mm AP; + 1.3 mm 
ML; -1.6 mm DY). Three jewelry screws were screwed into the skull surrounding 
the implant site of each hemisphere to provide extra anchor points. A layer of 
adhesive cement (C&B Metabond) was applied followed with dental cement 
(Teets cold cure; A-M Systems) to secure the optical fibre implant. A cap made 
from the bottom part ofa 15 ml Falcon tube (for unilateral implant) or the top part 
ofan Eppendorf tube (for bilateral implant) was inserted to protect the implant and 
the incision was closed with sutures. Mice were given 1.5mgkg ' metacam as 
analgesic and remained on a heating pad until fully recovered from anaesthesia. 
Mice were allowed to recover for 2 weeks before all subsequent experiments. All 
fibre placements (Supplementary Fig. 11) and viral injection sites were verified 
histologically. As criteria we only included mice with ChR2-EYFP expression 
limited to the DG, which led to the exclusion of two mice throughout the study. 
ChR2-EYFP and EYFP expression timeline. Fourteen days after surgery, sub- 
jects were either kept on Dox and immediately killed or taken off Dox for 1 or 2 days. 
The mice from the latter two groups were either killed with no further treatments 
(home cage), or underwent FC or NS protocols as described in the behaviour section 
below. After each treatment, mice were killed 1.5 h, 24h, 5 days or 30 days later, as 
described in the main text, and underwent immunohistochemistry procedures. 
For seizure experiments, mice were taken off Dox for 2 days and injected intra- 
peritoneally with 20 mgkg~' kainic acid. The mice were killed 6h after the first 
behavioural onset of seizure. 

In vivo recording. Mice were anaesthetized by isoflurane inhalation and placed in 
the stereotactic system with anaesthesia maintained with 0.5-1% isoflurane 
throughout the recording. An optrode consisting of a tungsten electrode (1 MQ) 
glued to an optical fibre (200 jum core diameter; Doric Lenses), with the tip of the 
electrode extending beyond the tip of the fibre by 500 ium was used for simultan- 
eous optical stimulation and extracellular recordings. The optrode was lowered to 
the dentate gyrus (—2.2mm AP; 1.3mm ML; —2.0mm DV) using a hydraulic 
micromanipulator (Model 640; David Kopf Instruments). The optical fibre was 
connected to a 200 mW 473 nm laser (MBL F473; Opto Engine) and controlled by 
a function generator (33220A; Agilent Technologies). The power intensity of light 
emitted from the optrode was calibrated to about 9 mW, which was consistent with 
the power intensity used in the behavioural assays. To identify ChR2-labelled cells, 
light pulses of 15 ms were delivered at 0.1 Hz at the recording sites approximately 


every 5-10 jim throughout the DG. After light responsive cells were detected, two 
types of light stimuli were tested: 15 ms light pulse every 10s and a train of ten 
15 ms light pulses at 20 Hz every 10 s. Unit activity was band-pass filtered (500 Hz- 
5 KHz) and acquired with an Axon Digidata 1440A acquisition system running 
Clampex 10.2 software. Data were analysed with custom software written in 
Matlab. After the recording, endogenous c-Fos expression was induced by deliver- 
ing two epochs of 3-min light stimulation (9 mW, 20 Hz, 15 ms), separated by 
3 min, to the DG, the same as in behavioural experiments (see below). Mice were 
killed and perfused 90 min later. 

Immunohistochemistry. Mice were overdosed with avertin and perfused trans- 
cardially with cold PBS, followed by 4% paraformaldehyde (PFA) in PBS. Brains 
were extracted from the skulls and kept in 4% PFA at 4°C overnight. Fifty- 
micrometre coronal slices were taken using a vibrotome and collected in cold 
PBS. For immunostaining, each slice was placed in PBST (PBS + 0.2% Triton 
X-100) with 5% normal goat serum for 1h and then incubated with primary 
antibody at 4°C for 24h (rabbit anti-c-Fos 1:5,000, Calbiochem; rabbit anti- 
GABA 1:5,000, Abcam; chicken anti-GFP 1:500, Invitrogen). Slices then 
underwent three wash steps for 10 min each in PBST, followed by 1 h incubation 
with secondary antibody (1:200 AlexaFlour488 anti-chicken, Invitrogen; 1:200 
AlexFlour568 anti-rabbit, Invitrogen). Slices were then incubated for 15 min with 
DAPI (1:10,000) and underwent three more wash steps of 10 min each in PBST, 
followed by mounting and coverslipping on microscope slides. 

Cell counting. To characterize the expression timeline of ChR2-EYFP and EYFP, 
the number of EYFP immunoreactive neurons in the DG were counted from six 
coronal slices (spaced 120 jtm from each other) per mouse (n = 5 for ChR2 group, 
n=3 for EYFP group). Coronal slices were taken from dorsal hippocampus 
centred on coordinates covered by our optical fibre implants (—1.94mm to 
—2.46mm AP; Supplementary Fig. 11). Confocal fluorescence images were 
acquired on a Leica TCS SP2 AOBS scanning laser microscope using a X20/ 
0.70 NA oil immersion objective. The image analysis module Visiomorph DP 
within VIS (Visiopharm) calculated the number of ChR2-EYFP-positive or 
EYFP-positive cells per section by thresholding EYFP immunoreactivity above 
background levels and using DAPI staining to distinguish between nuclei. The 
analysis module also permitted isolation of only ChR2-EYFP-positive and EYFP- 
positive neurons by setting size and fluorescence thresholds to filter out nuclear- 
localized c-fos-shEGFP-positive cells. The cell body layer of DG granule cells was 
outlined as a region of interest (ROI) according to the DAPI signal in each slice. A 
similar protocol was followed for c-Fos-positive cell counts in DG and CA3, except 
a Cy3 filter was applied for the latter. For quantification comparisons, we used a 
one-way ANOVA followed by Tukey’s multiple comparisons using « = 0.05. Data 
were analysed using Microsoft Excel with the Statplus plug-in and Prism 
(GraphPad Software). 

To analyse the overlap between c-Fos and ChR2-EYFP-expressing or EYFP- 
expressing cells, a z-stack method was used in conjunction with ImageJ*° to mont- 
age ten optical stacks (1 jum each, step size 10 um) taken under a X20/0.70 NA oil 
immersive objective. Separate GFP and Cy3 filtered images were digitally com- 
bined to produce composite images. Equal cutoff thresholds were applied to all 
captures to remove background autoflouresence. All imaging and analyses were 
performed blind to the experimental conditions. To quantify the expression levels 
of ChR2-EYFP per cell, an experimenter blind to each condition used Image] to 
calculate the fluorescence intensity signal as integrated density for ten randomly 
chosen DG cells per hippocampal slice (n = 3 slices per mouse, 5 mice per con- 
dition; Supplementary Fig. 6). 

Behaviour assays. All the behaviour tests were conducted during the light cycle of 
the day. Four different contexts were used in the behaviour assays. Context A was a 
30 X 25 X 33cm conditioning chamber within a room with black walls, black 
curtains, and dim lighting. The chamber had a white plastic floor and was scented 
with 0.25% benzaldehyde. Context B was a 29 X 25 X 22 cm conditioning chamber 
within a second room with white walls and bright lighting. The chamber had a 
gridded floor and a triangular roof, and was scented with 1% acetic acid. Context C 
was a 41 X 41 X 31cm unscented open field arena within a third room with white 
walls and intermediate lighting. Context D was a 29 X 25 X 22 cm conditioning 
chamber in the same room as context C. It had a white acrylic glass floor and was 
unscented. The experimental groups (Exp, Exp-lday and Exp-Bi) and EYFP 
control (EYFP) groups underwent exactly the same training protocol. During 
the habituation session, each mouse was introduced to context A daily for 5 days 
while on 40 mgkg | Dox food. Each day the mouse was loaded into the chamber 
and the optical fibre implant was connected to a 473 nm laser (MBL F473; Opto 
Engine) controlled by a function generator (33220A; Agilent Technologies). The 
mouse was then allowed to explore the chamber for 12 min. The 12 min session 
was divided into four 3-min epochs, with the first and third epochs as the light-off 
epochs, and the second and fourth epochs as the light-on epochs. During the light- 
on epochs, the mouse received light stimulation (9 mW, 20 Hz, 15 ms) for the 
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entire 3-min duration. At the end of the 12 min, the mouse was immediately 
detached from the laser and returned to its home cage. Following the fifth habitu- 
ation session, the mouse was kept on regular food without Dox for 1 (Exp-1day) or 
2 (Exp, Exp-Bi and EYFP) days until the training session. On the training day the 
mice received three training trials separated by 3h in their home cages. For each 
training trial, the mouse was kept in the conditioning chamber in context B for 
500s. A tone (20s, 75 dB, 2,000 Hz) was turned on at 180s, 260s, 340s and 420s, 
each of which co-terminated with a foot shock (2s, 0.75mA). After the third 
training trial, the mouse was returned to its home cage and placed on food contain- 
ing 1 gkg”' Dox overnight to rapidly turn off any additional ChR2-EYFP or EYFP 
expression. The test session started the next day and the mouse was switched back 
to food containing 40 mg kg ' Dox. The procedure for the 5-day test session was 
exactly the same as the habituation session in context A. The day after the last test 
session, the mouse was returned to the original context B and exposed to the 
chamber for 300 s for a retrieval session to assay contextual fear memory. The next 
day, the mouse was introduced to context D for cued fear memory retrieval. This 
session lasted for 780 s, with a tone (60 s, 75 dB, 2,000 Hz) turned on at 180s, 420s 
and 660s. The no shock (NS) group went through the same habituation, training 
and test sessions as the Exp group, except that no foot shock was given during the 
training session. The open field fear-conditioned (OF-FC) group went through the 
same habituation sessions. After the fifth habituation session, Dox was removed 
from the mouse’s diet for 2 days, followed by exposure to the open field arena in 
context C to allow for 10 min of active exploration. The mouse was subsequently 
returned to its home cage and placed on 1 gkg_' Dox food overnight. The following 
day, the mouse was fear-conditioned in context B in the same manner as described 
above. Test sessions were administered over 5 days in context Aon 40 mgkg | Dox 
food, and the OF-FC group also underwent context and tone probe trials after the 5 
days of testing. The fear-conditioned open field (FC-OF) group went through the 
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same habituation sessions. The day after the fifth habituation, the mouse was kept 
on 40mgkg ‘ Dox food and went through the FC procedure in context B as 
described above. The mouse was placed off Dox for 2 days after FC. Then the mouse 
was exposed to the open field arena in context C and allowed to freely explore for 
10 min, after which the mouse was returned to their home cage with 1gkg * Dox 
food overnight, followed by test sessions over 5 days in context A on 40 mgkg ! 
Dox food. Freezing behaviour for training, context, and tone probe trials was 
recorded with a digital camera and measured with FreezeFrame software 
(ActiMetrics). Light stimulation during the habituation and test sessions interfered 
with the motion detection of the program, and thus the freezing of these sessions 
was manually scored. Two experimenters scored each video independently in a 
double-blinded manner. The overall scores showed a <3% difference between the 
two experimenters and for simplicity only one set of scores from one experimenter 
was reported. The manual scoring and computer scoring of the same training videos 
gave similar freezing scores. For each group, within each session (habituation and 
test) and within each epoch (light-on and light-off), a one-way ANOVA with 
repeated measures followed by Tukey’s multiple comparisons (« = 0.05) revealed 
no difference over 5 days (Supplementary Fig. 5). We therefore averaged the freez- 
ing level over 5 days for each mouse. A two-way ANOVA with repeated measures 
followed by Tukey’s multiple comparisons (% = 0.05) revealed that only the experi- 
mental groups (Exp, Exp-1day and Exp-Bi) showed increases in averaged freezing 
levels for light-on epochs of test sessions compared to light-off epochs of test 
sessions and light-on epochs of habituation sessions (Fig. 3a, d, e). 
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Alternative messenger RNA splicing is the main reason that vast 
mammalian proteomic complexity can be achieved with a limited 
number of genes. Splicing is physically and functionally coupled to 
transcription, and is greatly affected by the rate of transcript 
elongation’’. As the nascent pre-mRNA emerges from transcrib- 
ing RNA polymerase II (RNAPII), it is assembled into a messenger 
ribonucleoprotein (mRNP) particle; this is the functional form of 
the nascent pre-mRNA and determines the fate of the mature tran- 
script*. However, factors that connect the transcribing polymerase 
with the mRNP particle and help to integrate transcript elongation 
with mRNA splicing remain unclear. Here we characterize the 
human interactome of chromatin-associated mRNP particles. 
This led us to identify deleted in breast cancer 1 (DBC1) and 
ZNF326 (which we call ZNF-protein interacting with nuclear 
mRNPs and DBCl1 (ZIRD)) as subunits of a novel protein 
complex—named DBIRD—that binds directly to RNAPII. DBIRD 
regulates alternative splicing of a large set of exons embedded in 
(A + T)-rich DNA, and is present at the affected exons. RNA- 
interference-mediated DBIRD depletion results in region-specific 
decreases in transcript elongation, particularly across areas encom- 
passing affected exons. Together, these data indicate that the 
DBIRD complex acts at the interface between mRNP particles 
and RNAPII, integrating transcript elongation with the regulation 
of alternative splicing. 

The composition of mRNP particles has been the subject of a number 
of studies, using a variety of approaches (for example, ref. 5). There are 
likely to be different types of mRNP particles with distinct compositions 
and interaction partners. We sought to purify native mRNP particles 
and interacting proteins from the chromatin in which they are generated 
and in which they are active in co-transcriptional processes. Asa starting 
point, we generated HEK293 cells expressing near-normal levels of Flag- 
tagged heterogeneous nuclear ribonucleoprotein (hnRNP) Al, an 
abundant hnRNP protein in human cells®. hnRNP A1 shuttles between 
the nucleus and the cytoplasm’, but at steady state it is mainly nuclear 
and concentrated in chromatin (Fig. 1a), from where it can be released by 
RNase A treatment (Fig. 1b, compare lanes 2 and 4). We used DNase I 
digestion and mild sonication to release mRNP particles from chromatin 
for purification. RNase inhibitors were present during the whole process, 
outlined in Fig. 1c. mRNP particles isolated by this approach are pre- 
dominantly of nuclear (chromatin) origin (Supplementary Fig. 1). 
Native mRNP particles and their interacting partners were purified from 
chromatin isolated from ~10* nuclei (Fig. 1d). Only two major bands 
(namely the added, proteinaceous RNase inhibitors) were detected after 
purification from control cells, whereas numerous proteins were 
detected in hnRNP Al-Flag elutions (Fig. 1d). These represent a 
heterogeneous mixture of core mRNP particle subunits and proteins 
interacting with such particles. Individual protein bands were excised 
and identified by mass spectrometric analysis (Fig. 1d; see also 


Supplementary Fig. 2a). Most of the known ‘core’ mRNP proteins, such 
as the hnRNP proteins, were present in the purified fraction, confirming 
the biological relevance of this approach. Many other pre-mRNA pro- 
cessing proteins were also identified, including splicing factors, ATP- 
dependent RNA helicases and a substantial number of mRNA 3’-end 
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Figure 1 | Purification of nascent nuclear mRNP particles. a, Western blot 
analysis of cytoplasm (C), nucleoplasm (N) and chromatin (Ch), with o-tubulin, 
lamin B2 and histone H3 as controls for different fractions. b, Fractionation as in 
part a, but RNase A was added to the nuclear lysis buffer in the fractions 
indicated. c, Outline of the purification procedure. d, Equal fractions of the M2 
chromatography eluates from control (Mock) and hnRNP A1-Flag separated by 
4-12% SDS-PAGE and stained with SYPRO ruby. Asterisks mark RNase 
inhibitor proteins. Some of the identified proteins are indicated on the right. 
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processing and termination factors. Co-immunoprecipitation experi- 
ments confirmed the RNA-dependent interaction of some of these 
proteins with hnRNP Al-Flag (Supplementary Fig. 2b). 

We next focused on two proteins that had not previously been 
connected to mRNP particles or mRNA processing. One of these, 
DBCI, is best known for its association with, and regulation of, the 
sirtuin-like deacetylase SIRT1 (refs 8, 9). We also investigated the 
uncharacterized zinc-finger-containing protein ZNF326. Stable cell 
lines expressing near-normal levels of Flag-tagged versions of these 
proteins were established, and co-immunoprecipitation experiments 
confirmed that both DBC1 and ZNF326 interact with mRNP particles 
in an RNA-dependent manner (Supplementary Fig. 3a-f). Furthermore, 
we discovered that ZNF326 and DBC1 associate directly, in an RNA- 
independent manner (Fig. 2a, e). For this reason, hereafter we refer to 
ZNF326 as ZNF-protein interacting with nuclear mRNPs and DBC1 
(ZIRD). 

We previously identified DBC1 as an RNAPII-interacting protein in 
another proteomic screen'’, making it a particularly interesting can- 
didate. Co-immunoprecipitation experiments confirmed that RNAPII 
associates with DBC1-Flag in an RNA-independent manner (Fig. 2b). 
Furthermore, ZIRD was detected in RNAPII (RPB3-Flag) purifica- 
tions, and this interaction was also RNA-independent (Fig. 2c). In 
further support of a ZIRD-RNAPII interaction, ZIRD-Flag also co- 
immunoprecipitated RNAPII (Fig. 2d). In contrast, we failed to detect 
an interaction between hnRNP Al and RNAPII under the same con- 
ditions (Fig. 2c, middle panel, and data not shown), although co- 
immunoprecipitation experiments after formaldehyde crosslinking 
indicated that, as expected, the proteins are in close proximity in vivo 
(Supplementary Fig. 4). Together, these results indicate that DBC1 and 
ZIRD are not part of the core mRNP particle, but that they might work 
at the interface between the mRNP particle and RNAPII. 
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Others reported that DBC1 interacts with SIRT1 (refs 8, 9). 
Although we observed that DBC1 co-precipitated SIRT1, endogenous 
ZIRD and ZIRD-Flag did not (Fig. 2e, and data not shown). SIRT 1 is 
also absent from hnRNP-A1-containing mRNP particles (Supplemen- 
tary Fig. 3g). This indicates that ZIRD and DBC] form a complex that 
lacks SIRT1. To characterize the ZIRD-DBC1 interaction further, 
ZIRD-Flag was purified. Size-exclusion chromatography of highly 
purified material showed that ZIRD-Flag and DBC1 are part of a 
salt-stable ~800-kDa complex (Fig. 2f) that also co-purified on 
MonoQ (data not shown). As expected, SIRT1 is not part of this 
protein complex (Fig. 2g, and data not shown). We named it the 
DBC1-ZIRD complex (DBIRD). 

DBC1 and ZIRD interact with RNAPII in crude extracts (Fig. 2b-d). 
To investigate whether this interaction is direct, the DBIRD complex 
was characterized by gel filtration after mixing with an excess of 
RNAPII. In the absence of RNAPII, the DBIRD complex peaked in 
fractions 13-15 (Fig. 2f, upper two panels). However, when mixed with 
RNAPII (Fig. 2f, lower three panels), DBIRD complex elution shifted 
to earlier fractions, peaking in fraction 10 with a sub-fraction of 
RNAPH, whereas polymerase alone peaked in fractions 17-19 
(~500 kDa), as expected. The DBIRD complex thus seems to form a 
bridging complex that interacts with both mRNP particles and 
RNAPII. Interestingly, DBIRD also interacted with mRNP particles 
lacking hnRNP A1 (Fig. 2h), pointing to a general bridging role. 

To examine the role of the DBIRD complex in transcription- 
associated processes in vivo, we analysed the transcriptome of cells 
that had been depleted for DBC1 or ZIRD by RNA interference 
(RNAi) (Supplementary Fig. 5a). Total mRNA was hybridized to 
GeneChip Human Exon 1.0 ST arrays, on which the abundance of 
individual exons can be analysed independently. In the absence of 
ZIRD, a greater than 1.5-fold increase in exon inclusion was observed 
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a, Western blot analysis of anti-Flag immunoprecipitates (IPs) from ZIRD- 
Flag cells. b, As in a, but for DBC1-Flag cells. c, As in a, but RPB3-Flag cells. 
d, As in a, but detecting RPB1. e, Western blot analysis of anti-DBC1 and 
anti-ZIRD immunoprecipitates. IgG, immunoglobulin-y. f, DBIRD analysed 


two panels) RNAPII in approximately fivefold molar excess. Vo, void volume 
fraction. g, Silver stain of DBIRD. Asterisks indicate DBC1 and ZIRD 
degradation products. h, Western blot analysis of hnRNP-C-containing mRNP 
particles from mouse CB3 cells lacking hnRNP A1 (ref. 20). 
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in more than 2,800 situations, whereas exon exclusion was observed in 
only 390 cases (Supplementary Table 1a). The absence of DBC] led to 
increased inclusion of an exon in 796 cases (Supplementary Table 1b) 
and, notably, most of these events were also on the list of ZIRD- 
dependent exon inclusions (567 out of 796 (71%); P-value for shared 
exons = 6.705 X 10 7°'; Supplementary Table 1c), which strongly 
supports a close functional relationship between the two factors and 
provides confidence in the genome-wide alternative splicing data sets. 
The effect was at the level of alternative splicing, as depletion of ZIRD 
or DBC] only affected the expression of a very small number of genes 
(Supplementary Fig. 6). 

A full list of inclusion events observed in both DBC1- and ZIRD- 
depleted cells is in Supplementary Table 1c. Sample results were con- 
firmed by quantitative PCR with reverse transcription (RT-PCR) 
(Supplementary Fig. 7). To investigate whether DBIRD was present 
at affected exons, we performed RNA immunoprecipitation experi- 
ments''. DBC1 and ZIRD bound the relevant exon in mRNAs from 
seven tested genes, whereas other regions (or control transfer RNA) 
were not detected or detected to a much lower extent (Fig. 3a, b and 
Supplementary Fig. 8). Interestingly, some exons of the B-actin gene 
(whose splicing was unaffected by DBIRD depletion) had considerable 
levels of DBIRD complex as well (Supplementary Fig. 8), indicating 
that the interaction of DBIRD with mRNA is not invariably correlated 
with DBIRD-dependent splicing changes. 

To investigate the mechanism underlying exon inclusion, we first 
searched for sequence motifs in the DNA encompassing the included 
exons, but failed to uncover motifs other than those known to typify 
splice junctions. We then looked for nucleotide patterns that might be 
over-represented in the sequences surrounding the included exons by 
counting how often each of the 1,024 possible 5-base oligonucleotides 
occurred. Interestingly, (A + T)-rich 5-base oligonucleotides were 
markedly enriched around included exons (Fig. 3c). The frequencies 
of the four nucleotides in the regions around the splice sites were also 
analysed. A and T were strongly over-represented around the splice 
sites of DBIRD-affected exons, as well as across the exons themselves 
(Fig. 3d). The observed difference in A + T content is sufficient to 
explain the over-representation of (A + T)-rich 5-base oligonucleotides 
(Supplementary Fig. 9). 

The (A + T)-rich DNA surrounding the affected exons might 
influence fundamental aspects of transcription. Indeed, A- and 
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T-tracts are difficult for RNAPII to transcribe, as they constitute very 
efficient elongation pause sites in vitro'*'*. To investigate the effect of 
DBIRD on transcript elongation, we performed RNAPII chromatin- 
immunoprecipitation (ChIP) analysis after DBIRD knockdown. For a 
control, we also knocked down SIRT1 (Supplementary Fig. 5b). 
Remarkably, although overall transcription of RAD50 and SLC36A4 
is not affected (see Supplementary Fig. 7), depletion of DBC1 or ZIRD 
(but not SIRT1) markedly affected RNAPII transcription distinctively 
in regions encompassing affected exons (Fig. 4 and Supplementary 
Fig. 11). Quantification of newly produced mRNA by bromo-UTP 
incorporation supported the idea that elongation rates were decreased 
in these regions (Supplementary Fig. 10). DBIRD depletion also affected 
RNAPII density at other genes whose splicing was exon-specifically 
affected, whereas little or no change in RNAPII density was observed 
at the unaffected B-actin control gene, even at exons that had an ele- 
vated DBIRD level (Supplementary Figs 11 and 12; compare to 
Supplementary Fig. 8). 

Our data support the idea that the DBIRD complex functions at 
the interface between core mRNP particles and RNAPII, affecting 
local transcript elongation rates and alternative splicing at a subset 
of (A + T)-rich exon-intron junctions (Supplementary Fig. 13). 
Notably, several studies have shown that the rate of RNAPII elonga- 
tion affects the efficiency of splicing, with slow elongation favouring 
exon inclusion’”. Therefore, one possible explanation for our data is 
that the DBIRD complex acts as an elongation factor that facilitates 
transcript elongation across (A + T)-rich regions, and thereby 
affects alternative splicing of exons in these regions. It has also been 
suggested that exons in the nascent pre-mRNA become tethered to the 
elongating transcription complex'*’’. Given that DBIRD binds both 
mRNPs and RNAPII, it might affect such tethering, and thereby affect 
splicing. 

DBCI1 has been implicated in tumorigenesis as a potential tumour 
suppressor, regulating apoptosis and cell survival’®. Whether the role 
of DBC1 in the DBIRD complex and alternative splicing affects tumor- 
igenesis is an interesting possibility, particularly in light of the recent 
finding that genes encoding components of the splicing machinery are 
often mutated in myelodysplastic syndromes and related disorders”. 
ZIRD has not previously been characterized in human cells, but its 
mouse homologue, ZAN75, is highly expressed in neuronal tissues’®, 
suggesting that regulation of the DBIRD complex might contribute to 


Figure 3 | DBIRD affects 
alternative splicing and is present at 
the affected exons. a,b, RNA 
immunoprecipitation (RIP) from 
crosslinked control (EV, ctrl), DBC1- 
Flag or ZIRD-Flag cells, analysed by 
quantitative PCR (qPCR). Control 
reactions lacking reverse transcriptase 
were always included (not shown). 
Error bars indicate standard 
deviations according to the Poisson 
statistic; n = 3. EV, empty vector; Ex, 
exon; Int, intron; Pr, promoter. 
Arrows indicate the start of 

5’ splice site transcription and the end of the 

Vv coding region, respectively. 
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Figure 4 | DBC1 and ZIRD link exon skipping to RNAPII transcription. 
a, RAD50 gene and qPCR primers (upper panel). RNAPII ChIP using cells 
transfected with control (scramble), DBC1, ZIRD or SIRT1 Stealth siRNAs 
(lower panel). ChIP signals were normalized with inputs. Signals in control cells 
were set to 1 at each position, and values obtained from factor-depleted cells 
were expressed relative to these signals. Errors bars denote standard deviation; 
n= 3.b, Same as ina, but for SLC36A4. Supplementary Fig. 12 shows the same 
data in a format in which gene positional information is maintained. FR, 
flanking region. Arrows indicate the start of transcription and the end of the 
coding region, respectively. 


tissue-specific splicing. Other proteins with homology to ZIRD and 
DBCI1 exist in the human genome, raising the possibility that other 
DBIRD-like complexes are specific for other sets of genes or exons, or 
are involved in other transcription-related nuclear events. 


METHODS SUMMARY 

Open reading frames encoding hnRNP Al, DBC1 and ZIRD were cloned into 
pIRESpuro (Clontech) with a carboxy-terminal Flag tag. HEK293 cells were grown 
in Dulbecco’s Modified Eagle Medium (DMEM) containing 10% FBS in 5% CO, 
at 37 °C. For proteomic analysis, nuclei were isolated from hnRNP A1-Flag cells. 
These were sonicated and treated with DNase I, and the sample was cleared by 
centrifugation and the supernatant was subjected to M2 agarose (Sigma) chro- 
matography. Proteins were eluted with 3x Flag peptide and mass spectrometry 
was performed as has been described elsewhere'®. DBIRD was purified by M2 
agarose chromatography from nuclease-treated nuclear extract from cells expres- 
sing ZIRD-Flag. DBIRD was analysed by MonoQ, or size exclusion chromato- 
graphy with or without an excess of RNAPII. Stealth short interfering RNAs 
(siRNAs) were double transfected in HEK293 cells using lipofectamine 2000 
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(Invitrogen). For microarray analysis, RNA was hybridized on Human Exon 1.0 
ST arrays (Affymetrix) using standard techniques (bioinformatics analysis 
described in Methods). For assessment of exon abundance and transcript expres- 
sion, quantitative RT-PCR was performed using primers against affected and 
unaffected exons. Primer details are given in Supplementary Table 2. RNA immu- 
noprecipitation and ChIP assays were performed as described’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plasmids and antibodies. Open reading frames encoding human hnRNP A1, 
DBC1 and ZIRD (ZNF326) were cloned into pIRESpuro (Clontech) with a Flag 
tag at the C terminus. Antibodies used were rabbit anti-Flag, mouse anti-Flag M2 
and mouse anti-hnRNP C (Sigma); mouse anti-pCTD mAb 4H8 (Millipore); 
rabbit anti-lamin B2 (Acris); rabbit anti-ZNF326 and mouse anti-hnRNP Al 
(Santa Cruz Biotechnology); and rabbit anti-DBC1 and rabbit anti-SIRT1 
(Bethyl Laboratories). 

Cell culture, stable-cell-line establishment and stealth siRNA transfection. 
HEK293 cells were grown in DMEM containing 10% FBS in 5% COQ, at 37°C. 
To generate HEK293 stably expressing a Flag-tagged protein, cells were trans- 
fected with the relevant pI[RESpuro construct and selected in 1 jig ml ' puromycin 
(Sigma). Cells were maintained in selecting media for 3 weeks, and surviving cells 
were used for experiments after transgene expression was checked. 

Stealth siRNAs were double transfected in HEK293 cells using lipofectamine 

2000 (Invitrogen) according to the manufacturer’s instructions. Protein and RNA 
expression was checked 48h after the second transfection. Stealth siRNA anti- 
ZIRD RNAi sequences were: 5'-CGGAGGUAGUUAUGGUGGUCGAUUU-3’ 
(sense); 5’AAAUCGACCACCAUAACUACCUCCG-3’ (antisense). Stealth 
siRNA anti-DBC1 RNAi sequences were: 5’-CCAUCUGUGACUUCCUAGAAC 
UCCA-3’ (sense); 5’-UGGAGUUCUAGGAAGUCACAGAUGG-3’ (antisense). 
For stealth siRNA anti-SIRT1 we used validated stealth siRNA (Invitrogen; oligo 
ID VHS50609). For control siRNA we used Stealth siRNA negative control med 
GC (Invitrogen;12935-300). 
Immunopurification of native mRNPs. 10° cells stably expressing hnRNP Al- 
Flag were lysed with cytoplasmic lysis buffer (10 mM Tris HCl (pH 7.9), 340 mM 
sucrose, 3mM CaCl, 2mM Mg(OAc)2, 0.1 mM EDTA, 1mM dithiothreitol 
(DTT), 0.5% tergitol-type NP-40, protease inhibitors and 1 ul ml! RNasin 
Ribonuclease inhibitor (Promega)), and intact nuclei were pelleted by centrifu- 
gation at 3,500g for 15min. Nuclei were washed with cytoplasmic lysis buffer 
without NP-40 and then resuspended in DNase I buffer (20mM HEPES 
(pH7.9), 10% glycerol, 1.5mM MgCl2, 1mM DTT, protease inhibitors and 1 pl 
ml ' RNasin). After ten strokes in a Dounce homogenizer, nuclei were sonicated 
using Bioruptor (Diagenode) before DNase I (Sigma) was added to the buffer and 
incubated for 30 min at room temperature. Buffer was then adjusted to a final 
concentration of 250 mM KOAc and 1% Triton X-100. The sample was cleared by 
centrifugation at 20,000g for 30 min and the supernatant was collected. For nega- 
tive control purification, the same extracts were prepared from the same amount of 
untagged cells. 

The sample was then applied to M2 agarose beads (Sigma) and incubated for 4h 
at 4 °C. After binding, beads were washed extensively with washing buffer (20 mM 
HEPES (pH7.9), 250 mM KOAc, 1% Triton X-100, 10% glycerol, 3mM EDTA, 
1mM DTT, protease inhibitors and 1 pl ml! RNasin Ribonuclease inhibitor)’°. 
Finally, proteins were eluted by using Flag elution buffer (20 mM HEPES (pH 7.9), 
100 mM KOAc, 3mM EDTA, 1 mM DTT, 200 pg ml !3x Flag peptide, protease 
inhibitors and 1 pl ml” ' RNasin Ribonuclease inhibitor). Eluates were resolved by 
4-12% bis-Tris-gradient SDS-PAGE and revealed by SYPRO Ruby staining 
(Invitrogen). 

Mass spectrometric analysis. Protein samples were reduced, alkylated and 
digested with trypsin, using the Janus liquid handling system (PerkinElmer). 
The digests were subsequently analysed by liquid chromatography tandem mass 
spectrometry on an LTQ Orbitrap XL mass spectrometer (ThermoScientific). The 
resulting data were searched against a protein database (UniProt KB) using the 
Mascot search engine programme (Matrix Science)*'. All data were analysed 
manually. 

Purification of the DBC1-ZIRD complex. Nuclei from approximately 10° 
ZIRD-Flag cells were isolated, washed as above and then sonicated (Bioruptor, 
30-s on-off cycles, max intensity for 15 min). Nucleic acids were digested by 
adding 10,000 units per ml Benzonase (Novagen) and 30pgml' RNase A 
(Sigma), and incubating at 4°C for 1h. The nuclear extract was then adjusted 
toa final concentration of 250 mM KOAc and the insoluble fraction was removed 
by centrifugation (20,000g for 30 min). The supernatant was used for Flag-M2 
chromatography. After extensive washes (60 column volumes of 20 mM HEPES- 
KOH (pH7.9), 0.5% Triton X-100, 10% glycerol and 250mM KOAc), bound 
proteins were eluted in the same buffer as above but containing 0.5mgml | 3x 
Flag peptide (and 100 mM KOAc). The complex was then dialysed into buffer A 
(20mM Tris-HCl (pH 7.9), 10% glycerol and 0.01% NP-40) containing 100 mM 
NaCl, before MonoQ PC 1.6/5 (GE Healthcare) or size exclusion (Superose 6 PC 
3.2/30; GE Healthcare) chromatography. Proteins were eluted from MonoQ by a 
salt gradient from 0.1 to 1 M NaCl in buffer A. For size-exclusion chromatography, 
samples were loaded in buffer B (20 mM HEPES-KOH, 0.01% NP40, 10% glycerol 
and 250 mM KOAc) with or without pre-incubation on ice with a five- to tenfold 
molar excess of purified mammalian RNAPII, purified as described”. Fifty-mico- 


litre fractions were collected, and sizes were estimated by running protein size 
markers (Biorad) in parallel. 

Microarray analysis. RNAs were extracted using RNeasy Kit (Qiagen), DNase-I- 
treated on the column, labelled and hybridized on Human Exon 1.0 ST arrays 
(Affymetrix) using standard techniques. Three independent experiments were per- 
formed and used as real triplicate for data analysis. We processed core-probe-level 
signals using robust multiarray average (RMA) implemented in APT (apt-1.10.0, 
Affymetrix) to generate quantile-normalized probe-set and gene-level signal esti- 
mates. Probe set to transcript cluster metagrouping was obtained from Affymetrix. 
We removed control probe sets from further analysis. For the gene-level analysis, 
genes displaying a coefficient of variance of less than 0.05 were assumed to be 
uninformative and were removed. We determined transcriptional effects (DBC1 
versus control, ZIRD versus control and DBC1 versus ZIRD) by linear model, 
moderating the f-statistics by empirical Bayes shrinkage. We selected differential 
genes using a 0.05 P-value threshold using a nested F method. The analysis was 
carried out using the limma package from Bioconductor version 2.3 (ref. 23). 

To identify putative alternative splicing events we first filtered probe sets to 

reduce false positive events. We removed all probe sets that did not localize to 
unique loci in the genome (Affymetrix annotation). We removed all probe sets 
from transcripts identified as not expressed in a given condition, as it is not 
possible to determine alternative splicing events against an untranscribed back- 
ground. We defined a transcript as not being expressed if less than half of its 
member probe sets had a detection P value (detection above background value 
(dabg)) of less than 0.05 across all replicates. We also removed probe sets display- 
ing a dabg value of greater than 0.05 in all replicates in both the conditions being 
considered. Genes with less than three probe sets after filtering were also removed 
from the analysis. To identify pairwise alternative splicing events we fitted 
transcript-cluster-specific linear models to probe set signal estimates and tested 
for significant interactions between each probe set and gene-level signal estimates 
across pairwise conditions using a 0.01 P-value threshold and a nested F method. 
In cases in which multiple probe sets mapped to a single exon, only probe sets that 
had significant interactions were included in the results. Once again the analysis 
was carried out using the limma package from Bioconductor”. 
Computational analysis of sequences around affected exons. To investigate 
sequence features that might explain the differential enrichment of certain exons 
after knockdown of ZIRD or DBC1, we selected a set of 505 exons that were 1.5- 
fold enriched in both the ZIRD and the DBC1 knockdowns and which were not 
the first or the last exon (to avoid including 5’ and 3’ untranslated regions in the 
following analyses). As a negative control set, we selected a further 3,877 exons that 
were unaffected after depletion of ZIRD and that were not the first or the last exon 
in the gene. We then prepared a positive and a negative sequence set for both the 5’ 
and 3’ splice sites. For the 5’ splice site, the positive set contained the regions from 
—200 nucleotides to +50 nucleotides around the 5’ splice sites of the 505 enriched 
exons. The negative control set contained corresponding regions for the 3,877 
unaffected exons. Similarly, the positive set for the 3’ splice site contained the 
region from —50 nucleotides to +200 nucleotides around the 3’ splice sites of the 
505 enriched exons, and the negative set contained the corresponding regions of 
the 3,877 unaffected exons. 

The P values in Supplementary Fig. 9 were obtained in the following way. 
Suppose ab to be a dinucleotide and ia position around a splice site. We calculated 
the frequency P..,,(i,ab) of ab that we would expect given the frequency of ab in the 
negative set, Pneg(i,ab), and given the frequencies in the positive (P,os) and negative 
(Pneg) sets of mononucleotide a at position i, and b at position i+1, respectively: 


Pyos(i,) Ppos (i+ 1,6) 
Pregl isd) Pregl i+ 1,0) 


The P values for the observed number of dinucleotides ab at position i were 
calculated by approximating the binomial distribution with a normal distribution: 


N(N\ , _ 1 K—Np 
P value= y ( pra py’ * = ~erfe 
Paes 2 / Np(.—p) 


where N is the number of sequences in the set of affected exons, K is the number of 
observed dinucleotes ab at position i, Np is the expected number of dinucleotides at 
i, p =Pexp(i,ab), k is the summation value, and erfc is the complementary error 
function. 

Quantitative RT-PCR. RNAs were extracted using RNeasy Kit (Qiagen) and 
DNase-I-treated on the column, and 1 jig of total RNA was retro-transcribed using 
random primers and a first-strand complementary DNA synthesis kit 
(Fermentas). Quantitative RT-PCR was performed using SYBR Green detection. 
Specific primers against the alternatively spliced exon and unaffected exons were 
designed to assess exon abundance and transcript expression. Sequences are 
available on request. 


Pexp(i,ab) 


Preg(i,ab) 
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Chromatin immunoprecipitation. ChIP assays were performed as described’, 
using 4H8 antibody (Abcam) or IgG antibody as a negative control, and incuba- 
tion for 1 h with protein G/Herring sperm DNA. The precipitated DNA fragments 
were analysed using real-time PCR with SYBR Green detection. Input DNA was 
analysed simultaneously and used for normalization. 

RNA immunoprecipitation. RNA immunoprecipitation was performed essen- 
tially as described''**. Briefly, 10” HEK293 cells were crosslinked in 1% PFA for 
10 min, quenched with 0.125 M glycine for 5 min, washed twice in PBS and col- 
lected in lysis buffer (50 mM Tris-HCl (pH 8), 1% SDS, 10 mM EDTA, and 50 U 
per 500 pl RNasin and protease inhibitors). Samples were sonicated for 15 min at 
maximum power using Bioruptor (Diagenode) to obtain RNA fragments 200-600 
bases long, cleared by centrifugation and diluted tenfold in dilution buffer (20 mM 
Tris-HCl (pH8), 150mM NaCl, 2 mM EDTA, 1% Triton X-100, 50 U ml ! 
RNasin and protease inhibitors). Samples were pre-cleared by incubation with 
100 pl of IgG agarose beads (Sigma) for 2h at 4°C. Anti-Flag M2 Resin (Sigma) 
was added and samples were incubated for 4h at 4°C. Immunoprecipitates were 
washed three times in wash buffer 150 (20 mM Tris-HCl (pH 8,) 150 mM NaCl, 
2mM EDTA, 1% Triton X-100, 0.1% SDS, RNasin (50U ml ') and protease 
inhibitors), once in wash buffer 500 (same as above, but with 500 mM NaCl), once 
in LiCl (10 mM Tris-HCl (pH 8), 250 mM LiCl, 0.5% NP-40, 0.1% Deoxycholate, 
1mM EDTA, 50 U ml! RNAsin and protease inhibitors) and once in TE100 
buffer (TE containing 100 mM NaCl, 50 U ml ' RNasin, and protease inhibitors). 
Immunocomplexes were eluted three times in 150 ll elution buffer (TE containing 
100 mM NaCl, 200 pg ml! 3X Flag peptide, 50 U per ml RNasin) and incubated 
for 30 min at 37 °C with 1 pl of proteinase K. Crosslinking was then reversed by 
adding 9 il of 5 M NaCl and incubating the samples at 65 °C for 1 h. Nucleic acids 
were extracted with phenol chloroform, then precipitated with ethanol and any 
remaining DNA was eliminated with Turbo DNase (Ambion) treatment. RNAs 
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were then reverse transcribed using random primers and the cDNAs were used for 
subsequent PCR reactions using relevant primers. 

Nuclear run-on analysis. Cells were rinsed in PBS, then in buffer A (20mM 
Tris-HCl (pH7.4), 5mM MgCl, 0.5mM EGTA, 25% glycerol andl mM 
phenylmethylsulphonyl fluoride) and permeabilized in buffer A containing 
0.02% Triton X-100 for 3 min at room temperature. The nascent RNA labelling 
reaction was carried out in buffer A containing 2 mM ATP, 0.5 mM GTP, 0.5 mM 
CTP, 0.2mM BrUTP (Sigma) and 25Uml | RNasin (Promega) for 15 min at 
37 °C. In control reactions, normal UTP was used instead of BrUTP. After BrU 
incorporation, cells were rinsed twice in PBS and total RNA from nuclei of both 
labelled and control samples was isolated using TriPure Isolation Reagent (Roche). 
BrdU antibody (2 1g) (Sigma; also recognizes BrU) was pre-incubated with 20 pl 
of Protein G magnetic beads (Invitrogen) per experimental condition. RNA was 
then heated at 80 °C for 10 min and incubated with the beads at room temperature 
for 1h with gentle shaking. The beads were washed five times in PBS containing 
0.1% polyvinylpyrrolidone and RNasin (20 U per 200 ul), the RNA bound to the 
beads was eluted and the contaminant DNA was eliminated with Turbo DNase 
(Ambion). RNA was then reverse transcribed using random primers and the 
cDNAs were used for subsequent PCR reactions using relevant primers. 
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Differential positioning of adherens junctions is 
associated with initiation of epithelial folding 


Yu-Chiun Wang'”, Zia Khan***+, Matthias Kaschube*+ & Eric F. Wieschaus!” 


During tissue morphogenesis, simple epithelial sheets undergo 
folding to form complex structures. The prevailing model under- 
lying epithelial folding involves cell shape changes driven by 
myosin-dependent apical constriction’. Here we describe an 
alternative mechanism that requires differential positioning of 
adherens junctions controlled by modulation of epithelial apical- 
basal polarity. Using live embryo imaging, we show that before the 
initiation of dorsal transverse folds during Drosophila gastrulation, 
adherens junctions shift basally in the initiating cells, but maintain 
their original subapical positioning in the neighbouring cells. 
Junctional positioning in the dorsal epithelium depends on the 
polarity proteins Bazooka and Par-1. In particular, the basal shift 
that occurs in the initiating cells is associated with a progressive 
decrease in Par-1 levels. We show that uniform reduction of the 
activity of Bazooka or Par-1 results in uniform apical or lateral 
positioning of junctions and in each case dorsal fold initiation is 
abolished. In addition, an increase in the Bazooka/Par-1 ratio 
causes formation of ectopic dorsal folds. The basal shift of junctions 
not only alters the apical shape of the initiating cells, but also forces 
the lateral membrane of the adjacent cells to bend towards the 
initiating cells, thereby facilitating tissue deformation. Our data 
thus establish a direct link between modification of epithelial 
polarity and initiation of epithelial folding. 

The anterior and posterior dorsal transverse folds, or the dorsal 
folds, are epithelial folds that form on the dorsal side of the gastrulating 
Drosophila embryo at stereotypical locations coincident with the 
second and fifth stripes of the Runt expression (Fig. 1a—-f, Supplemen- 
tary Movie 1 and Supplementary Fig. 1a). Whereas the anterior fold is 
eventually shallow and the posterior fold deep, the initial cell shape 
changes are similar in both and the underlying mechanisms appear to 
be cell-autonomous (Supplementary Movies 2, 3 and Supplementary 
Fig. 1b, c). 

We monitored cell shape changes using two-photon laser scanning 
microscopy in live embryos that express a membrane marker conju- 
gated with the green fluorescent protein (Resille-GFP, also known as 
P{PTT-un1}CG8668"'”). Optical sectioning of embryos at the mid- 
sagittal plane reveals that two stripes of dorsal cells, each three to seven 
cells wide, narrow their apices and shorten cell length during early 
gastrulation, producing two clefts on the dorsal surface that represent 
the first morphological signs of dorsal fold formation (Supplementary 
Fig. 2a and Supplementary Movie 4, see also Fig. 4b for measurements 
of shortening). Cells that undergo apical narrowing retain dome-like 
apices (Supplementary Fig. 2b), contrasting with the flattened apical 
surface caused by apical constriction during Drosophila ventral furrow 
formation’. 

We sought to identify dynamic cellular processes that precede cell 
shape changes. Unlike the canonical mode of epithelial folding in 
which spatially restricted activation of the molecular motor, myosin II 
(encoded by spaghetti squash), drives localized apical constriction to 
initiate tissue deformation'”, the basal levels of apical myosin remain 


low and constant across the dorsal epithelium throughout the course of 
dorsal fold initiation with infrequent bursts of myosin activity that do 
not differ between the initiating and neighbouring cells (Supplemen- 
tary Movie 5 and Supplementary Fig. 3a, b). These results indicate that 
the initiation of dorsal fold formation is not associated with differential 
myosin contractility. 

In contrast, E-Cadherin (encoded by shotgun), the core component 
of adherens junctions, shows a cell-type-specific change in its position- 
ing: in the initiating cells, junctions shift basally from the subapical 
regions where they are originally assembled, whereas in the neighbour- 
ing cells junctions maintain their original subapical positioning 
(Fig. 1g, Supplementary Movies 6 and 7 and Supplementary Fig. 3c). 
Simultaneous imaging of E-Cadherin-GFP and Resille-GFP reveals 
that basal shift of junctions can be observed as early as 300 s before the 
onset of gastrulation during the last phase of cellularization, which 
precedes the apical narrowing and cell shortening that occur 100- 
200s after the onset of gastrulation (Fig. 1h and Supplementary 
Movie 8). During this seven-minute interval, junctions in the initiating 
cells shift approximately 10 um basally to lie at 34 + 5% (n = 18) below 
the apical surface, whereas junctions in the neighbouring cells show 
only a slight shift (~3 jim) to lie at 15 + 4% (n = 27) below the apical 
surface (Supplementary Fig. 4). The basal shift of junctions in the 
initiating cells increases the asymmetry in the junctional positioning 
on the opposite sides of the neighbouring cells that immediately flank 
the initiating cells. The lateral membrane of these cells becomes 
increasingly curved, correlating with the increased junctional 
asymmetry (Supplementary Fig. 5). 

If the apparent basal shift of E-Cadherin positioning reflects an 
actual movement or remodelling of the junctions, it should be asso- 
ciated with an increase in the volume and surface area above the 
junctions. To test this hypothesis, we measured the two-dimensional 
parameters of area and perimeter of the apical domain in the living 
embryos. As the junctions shift basally in the initiating cells, both of 
these parameters increase, consistent with a basal movement of the 
junctions within the cells (Supplementary Fig. 6). We corroborated 
these observations by developing computer software that recon- 
structs and quantifies three-dimensional cell shape in fixed embryos 
(Fig. 1i). As the cell length increases during the last phase of 
cellularization, the length, volume and surface area of the apical 
domain in the initiating cells all increase significantly more than 
they do in the neighbouring cells (Fig. 1j-l), indicating that the 
junctional shift is accompanied by an expansion of the apical 
domain and that mobility of the E-Cadherin complex underlies 
the apparent basal shift of the junctions. 

Adherens junctions are positioned to the subapical regions of the 
polarized epithelial cells by the concerted action of the scaffolding 
protein Par-3 (encoded by bazooka in Drosophila), the atypical protein 
kinase C (aPKC) and the MARK family kinase Par-1: apically localized 
aPKC and basal-laterally localized Par-1 restrict Par-3 to the subapical 
regions, where it directs junctional assembly*°. We found that the 
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Figure 1 | Morphology and cellular dynamics during dorsal fold formation. 
a, b, Scanning electron micrographs of the dorsal surface in an early (a) and a late 
(b) Drosophila gastrula. c-f, Confocal mid-sagittal sections of Neurotactin 
(green) and Runt (red) immunofluorescence in an early (c) and a late (d) gastrula. 
e, f, A magnified view of the highlighted areas in c and d. AF, anterior fold; CF, 
cephalic furrow; PF, posterior fold; PMG, posterior midgut; R2, second stripe of 
Runt; RS, fifth stripe of Runt. g, Two-photon time-lapse mid-sagittal section of 
E-Cadherin—-GFP. Arrows, junctions of initiating cells in the anterior (pink) and 
posterior (cyan) folds undergo basal shift. h, Two-photon time-lapse mid-sagittal 
section of E-Cadherin-GFP and Resille-GFP in a posterior fold-initiating cell 
with manual traces of cell outlines (green) and junctional position (red). Scale 
bars, 10 jum. i, Three-dimensional rendering of a posterior fold based on 
Neurotactin immunofluorescence with an initiating cell highlighted in grey (left 
panel, orange patches depict thresholded Bazooka staining). The Bazooka 
staining is used to subdivide the cell into the apical (orange) and basal (yellow) 
domains (right panel). j-1, Scatter plots of the average length, volume and surface 
area of apical domain against the average total cell length in a series of fixed late 
cellularizing embryos with solid and dashed trend lines for the initiating and 
neighbouring cells. Error bars indicate s.d. 


levels of Bazooka and aPKC are not differentially regulated across the 
dorsal epithelium and thus do not account for the observed junctional 
shift (Fig. 2a, Supplementary Movie 9 and Supplementary Fig. 7). In 
contrast, live imaging of Par-1-GFP shows that the levels of Par-1 in 
the presumptive initiating cells, although initially similar (~95%) to 
those in the neighbouring cells before the onset of junctional shift, 
reduce progressively during the last phase of cellularization to reach 
approximately 80% of its levels in the neighbouring cells as gastrula- 
tion commences (Fig. 2b, c and Supplementary Movie 10, n = 7). This 
differential modulation of Par-1 levels seems to require the anterior- 
posterior patterning system (Supplementary Fig. 8 and Supplementary 
Movie 11). To ask whether the reduction in Par-1 levels in the initiating 
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Figure 2 | The dynamics of Bazooka and Par-1 during dorsal fold initiation. 
a, b, Two-photon time-lapse mid-sagittal section of Bazooka—GFP (a) or 
Par-1-GFP (b). Arrows and brackets, anterior (pink) and posterior (cyan) fold- 
initiating cells. Scale bars, 10 um. ¢, A time-course analysis of Par-1-GFP levels 
in the initiating cells relative to those in the neighbouring cells (n = 7). 

d, A scatter plot of the average Bazooka positioning in the initiating cells 
normalized by that in the neighbouring cells against the average levels of 
Bazooka or Par-1 relative to their respective levels in the neighbouring cells with 
the corresponding trend lines. e, A scatter plot of the average Bazooka 
positioning along the apical—basal axis against the average Bazooka/Par-1 ratio 
within individual cells. Error bars indicate s.d. 


cells correlate temporally with the junctional shift, we quantified the 
levels of Par-1 in fixed embryos and determined the position of 
junctions using Bazooka staining. As Bazooka becomes more basally 
positioned in the initiating cells, their Par-1 levels also become lower, 
whereas the Bazooka levels remain constant (Fig. 2d). These analyses 
confirm our live imaging data and establish a correlation between the 
position of junctions and the ratio of Bazooka/Par-1 (Fig. 2e). 

This correlation suggests that Par-1 downregulation allows 
Bazooka to gradually localize more basally, which in turn directs basal 
repositioning of junctions. To test this hypothesis, we altered the levels 
of Bazooka and Par-1 to investigate the function of junctional 
positioning during the formation of dorsal folds. Uniform reduction 
of Bazooka activity by RNA interference (RNAi) causes accumulation 
of E-Cadherin-GFP at the edges between apical and lateral surfaces, 
resulting in an extreme apical positioning of junctions across the 
epithelium (Fig. 3a and Supplementary Movie 13), similar to embryos 
produced by the germline clones of a strong loss-of-function allele of 
bazooka (Supplementary Fig. 9). Conversely, in par-1 RNAi embryos, 
junctions are located in the lateral regions of all dorsal cells at an 
average position of 39 + 8% below the surface, slightly more basal than 
the junctions in the initiating cells in the wild-type (Fig. 3b and 
Supplementary Movie 14, 30 cells from 3 embryos). Importantly, in 
both bazooka and par-1 RNAi embryos, the junctional positioning is 
uniform across the entire dorsal epithelium and in each case, the 
initiation of dorsal folds is abolished despite the normal appearance 
of junction and epithelial structure (75% for bazooka RNAi, n = 8; 
70% for par-1 RNAi, n = 10). Thus, dorsal fold formation seems to 
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Figure 3 | Differential positioning of adherens junctions is necessary and 
overexpression of Bazooka can be sufficient for ectopic dorsal fold 
initiation. a-d, Two-photon time-lapse mid-sagittal section of E-Cadherin- 
GFP ina bazooka (a) or par-1(b) RNAi embryo or Bazooka®!*! $10854_GFP in 
an embryo in which the endogenous Bazooka is present (c) or downregulated 


require a differential positioning of junctions between the initiating 
cells and their neighbours. 

Par-1 phosphorylates and thereby excludes Bazooka from the basal- 
lateral regions of a polarized epithelial cell. We examined the beha- 
viour of Bazooka®!*!“ $1854. 4 mutant form of Bazooka that cannot be 
phosphorylated by the Par-1 kinase*. When the endogenous Bazooka 
is present, the GFP-tagged Bazooka’!?!“ $1°8°4 shows a subapical 
(junctional) distribution similar to the GFP-tagged wild-type form 
(Fig. 3c and Supplementary Movie 15). However, when we knocked 
down the endogenous Bazooka using RNAi, Bazooka‘!?!* $1084 
initially shows a broad distribution along the apical—basal axis and 
eventually coalesces in the lateral regions of all dorsal cells (Fig. 3d 
and Supplementary Movie 16). A similar localization was observed for 
wild-type Bazooka-GFP in par-1 RNAi embryos (Supplementary 
Movie 17) and in both cases, dorsal fold formation is blocked. These 
results indicate that serine 151 and 1085 of Bazooka are two main 
substrates of Par-1 during dorsal fold initiation, whose differential 
phosphorylation determines the heterogeneous positioning of 
Bazooka across the dorsal epithelium and is critical for dorsal fold 
initiation. 

When we altered the ratio of Bazooka/Par-1 by a uniform increase 
in Bazooka levels throughout the epithelium, we saw shifts of junctions 
that lead to eventual formation of epithelial folds in regions that are 
outside the sites of anterior and posterior folds and typically near the 
third and seventh stripes of Runt expression (Fig. 3e, Supplementary 
Movie 18 and Supplementary Fig. 10). An increase in the Bazooka 
levels alone in the cells that normally maintain a subapical positioning 
of junctions can thus be sufficient to drive junctional shift and epithe- 
lial folding, presumably by exploiting subtle local heterogeneities in 
Par-1 levels (Supplementary Fig. 11). 

In most epithelia, aPKC phosphorylates Bazooka and becomes seg- 
regated to establish the apical domain above the junctions””"®. We 
asked whether aPKC plays a role during junctional repositioning. In 
embryos that lack aPKC activity, the basal margin of the junctions 
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by bazooka 5'UTRRNAi (d). e, Confocal mid-sagittal section of Bazooka (red), 
Neurotactin (blue) and Runt (green) immunofluorescence in a Bazooka—~GFP 
overexpression embryo. R2, R3, R5 and R7 denote the Runt stripes. AF, anterior 
fold; PF, posterior fold. Scale bars, 10 jum. 


shows its characteristic basal shift in the initiating cells, but the apical 
margin unexpectedly maintains its typical subapical positioning, lead- 
ing to an abnormally wide junctional domain. In contrast, the width 
and positioning of the junctions in the neighbouring cells appear 
normal (Fig. 4a and Supplementary Movie 19). These results indicate 
that aPKC controls the apical margin to maintain the size of the junc- 
tions, but is not required for the basal shift of junctions. These obser- 
vations also decouple the junctional shift from an increase in the size of 
the apical membrane. The widening of junctional expanse was also 
observed in embryos that overexpress Bazooka®?*“, a mutant form of 
Bazooka that cannot be phosphorylated by aPKC (Supplementary 
Fig. 12)’. It seems that the segregation of aPKC from Bazooka 
establishes the apical domain, enabling junctional disassembly at the 
apical margin of the junctions. 
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Figure 4 | Loss of aPKC results in an expansion of the junctional domain 
and a failure to shorten the initiating cells. a, Two-photon time-lapse images 
of E-Cadherin—-GFP in an initiating (top) and a neighbouring cell (bottom) in 
an aPKC mutant embryo. Scale bar, 5 um. b, A time-course analysis of 
normalized cell length of the initiating cells in the wild-type (n = 5) and aPKC 
mutant (n = 4) embryos. Error bars indicate s.d. 
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Although the basal shift of junctions occurs in the initiating cells in 
the aPKC mutant embryo, these cells fail to shorten and the dorsal 
folds do not form properly (Fig. 4b and Supplementary Movie 19). It 
seems that as the basal margin of the junctions shifts basally in the 
initiating cells in response to a decrease in Par-1 levels, their apical 
margin needs to become disassembled in an aPKC-dependent manner 
so that the subsequent apical cell shape changes could occur. 

In this report, we present evidence that dorsal fold initiation 
requires the establishment of distinct ratios of Bazooka/Par-1 that 
impose different positions for the adherens junctions in the initiating 
cells and their neighbours. We propose that the differential positioning 
of junctions facilitates epithelial folding through two cellular processes 
(Supplementary Fig. 13). Within the initiating cells, the resultant 
increase in the non-adherent apical surface after junctional shift may 
be unstable such that a shrinkage of the apical domain is triggered to 
restore the balance between cell surface tension and local adhesive 
forces’’. The shortened cells thus produced would then create a loca- 
lized structural inhomogeneity in the epithelium where buckling 
would preferentially occur. Second, in the immediate flanking cells, 
a junctional asymmetry is produced because the basal positioning in 
the initiating cells on one side and the subapical positioning in the 
neighbouring cells on the other must be accommodated. Because all 
junctions in an epithelium are mechanically coupled”, the asymmetry 
may cause the lateral surfaces to curve and cells to bend towards the 
shortened initiating cells. This bending would drive and deepen any 
buckles or folds initiated in the epithelial sheet. 

Directional movement of the cadherin complex along the apical- 
basal axis has been observed previously in cultured cells in vitro’ but, 
to our knowledge, Drosophila gastrulation provides the first case where 
such movement has been described in an intact developing organism. 
When the shifts occur in stripes as they do on the dorsal side of the 
Drosophila embryo, they seem to initiate infolding of the epithelium. 
In tissues in which the levels of cortical myosin are low and constant, 
junctional repositioning regulated by Par-1/Bazooka interactions may 
play a more prominent role in epithelial folding than does differential 
activation of cortical contractility. Junctional repositioning may also 
represent an important mechanism in folding events that do not lead 
to internalization or delamination, or where the integrity of junctions 
within the epithelia must be maintained. How junctions are reposi- 
tioned while maintaining junctional integrity is unclear, but in 
principle the process could involve remodelling via local endocytic 
trafficking, or lateral movement of the intact junctions in the mem- 
brane“. Regardless of the mechanism, dorsal fold formation represents 
an emergent model in which the insights into this alternative mode of 
epithelial folding could be further analysed. 


METHODS SUMMARY 


Detailed information about reagents and methods used in this paper, including the 
Drosophila stocks, RNAi, live imaging, immunofluorescence, scanning electron 
microscopy, image processing, three-dimensional cell boundary reconstruction 
and image quantification is described in Methods. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Drosophila genetics. Drosophila stocks used for live imaging were: Resille-GFP"*, 
myosin-GEP (Spaghetti squash-GFP)'*, membrane-mCherry” (also known as 
P{sqh-mCherry.membrane}), E-Cadherin-GFP”’, Par-1 protein trap’* (also known 
as P{PTT-GC}par-1°°"'**'), mat-tub-GFP-Par-1 (ref. 19). UASp-Bazooka-GFP” 
was driven maternally by one copy (67C) of the matzTub-Gal4VP 16 driver in live 
imaging experiments and one or two copies (67C; 15) in overexpression experi- 
ments. UASp-Bazooka’!?!* $!°8°4.GFP* and UASp-Bazooka®’*°*-GFP° were 
driven by matxTub-Gal4VP16 (67C; 15). Mutant stocks used were: runt”, 
torso’®/torso’™*', torso-like'/torso-like’, bicoid®! nanos®™ torso-like?. Germline 
clones of bazooka®?”! and aPKC*°™”? were generated using the FLP-recombinase/ 
dominant female sterile system with the ovo”? FRT'™ or FRT°? ovo™chromosome 
(FRT'™ and FRT@? are also known as P{neoFRT}19A and P{FRT(w"')}G13). 
RNAi. Double-stranded RNAs were synthesized using a MEGAscript T7 kit 
(Ambion) from PCR products that contain the T7 promoter sequence (5’-TA 
ATACGACTCACTATAGGGTACT-3’) at each end. The PCR products used in 
in vitro transcription reactions were amplified from 0-4 h embryonic cDNA using 
the following primer pairs: bazooka, 5'-GACGTTTTCTTGCTAAGCGG-3’, 
5'-TTTCGCAGTGTAGGTCCAAA-3'; bazooka 5'UTR (for knockdown of 
endogenous but not transgenic bazooka), 5'-AATGCGCGCGTGTATGAATCA 
CAC-3', 5‘-ACGACCGCATCATCATCATCGTCG-3’; par-1, 5'-CACGTTCTG 
CGGTAGCC-3', 5'-GCTTGGGATCGGCTAAATC-3. Double-stranded RNAs 
were injected into the embryos during the syncytial blastoderm stage, typically 
3-4 h before imaging. 

Live imaging, immunofluorescence and scanning electron microscopy. T'wo- 
photon live embryo imaging was performed on a custom-made system built on an 
upright Olympus BX51 microscope that is equipped with a Ti:sapphire tunable 
laser ranged from 720 to 960 nm (Coherent). Single-photon confocal imaging was 
performed on a Leica SP5 system. Immunofluorescence was performed on heat- 
methanol fixed embryos*'. Antibodies used were mouse monoclonal anti- 
Neurotactin (BP106, Developmental Studies Hybridoma Bank, 1:20), rabbit 
anti-Runt (1:1,000), rabbit anti-Armadillo (1:200), rabbit anti-Par-1 (ref. 22, 
1:500), rabbit anti-PKC¢ C20 (1:1,000, Santa Cruz Biotechnology), and guinea 
pig anti-Bazooka” (1:500) and were visualized by Alexa 488-, 568- and 647- 
conjugated secondary antibodies (Molecular Probes). Scanning electron micro- 
scopy was performed on a Hitachi TM-1000 system as previously described'’. 
Images were processed, assembled into figures and converted into movies using 
ImageJ, Adobe Photoshop and Adobe Illustrator. 

Three-dimensional image processing and cell boundary reconstruction. The 
algorithm for three-dimensional reconstruction and analysis was implemented in 
C++ using the Qt library and OpenGL for the graphical user interface. Libtiff was 
used for loading image stacks. Image stacks of three-channel immunofluorescence 
were used. All channels of the 8-bit image volume were initially scaled to a 1:1:1 
aspect ratio (voxel size, 0.16 [1m X 0.16 um X 0.16 fim) and down-sampled by 80% 
(voxel size, 0.20 um X 0.20 um X 0.20 jim) to reduce image noise. The contrast of 
Neurotactin immunofluorescence was enhanced by adaptive histogram adjust- 
ment using eight equally spaced histograms along the z-dimension and the 10th 
percentile as the minimum intensity value for intensity adjustment. Two passes of 
rank filtering were used to fill weak regions of Neurotactin staining, closing holes 
between cells: a rank filter using the 95th percentile intensity value in a sphere with 
radius 0.6 jum, followed by a rank filter using the 10th percentile intensity value in a 
sphere with radius 0.8 um. Edges were found using a difference of Gaussians 
(DOG) approximation to a three-dimensional Marr-Hildreth edge detector where 
the zero-crossing was positioned at low threshold of 4 and high threshold of 30 to 
generate two binary image volumes”. A “rolling ball” algorithm applied to the 
high-threshold volume was used to repair holes in the epithelium due to ongoing 
cellularization. The algorithm was computed efficiently using boundaries in a 
Euclidian distance transform (EDT). Briefly, a boundary at distance of 3 tm 
was defined using the EDT. A second boundary, also at 31m from the first 
boundary, was used to approximate the result of rolling a sphere on the high- 
threshold binary image generated from the Marr-Hildredth operator. The repaired 
boundary was applied to the low-threshold binary image to obtain a binary image 
where the outer boundary of the epithelium was repaired. This binary image was 
then thinned by three-dimensional surface thinning’*. Connected components in 
the surface-thinned binary image were found by depth first search. Components of 
fewer than 100 voxels were removed as noise. Cells were found in this image by 
hierarchical application of a seeded Watershed algorithm”. Seed regions were 
defined hierarchically by gradually applying an increasing threshold to an EDT of 
the thinned binary image until a Watershed-segmented region reached a volume of 
less than 640 um’. Regions smaller than 40 jum? were removed as noise. The 
segmented cell regions were then converted into three-dimensional triangle 
meshes by the Marching Cubes algorithm”. Lastly, the resulting meshes were 


adapted to the intensity of the image by a finite difference approximation to an 
Active Surface”. 

Image quantification. The frequency of myosin bursts (Supplementary Fig. 3b) 
was measured in confocal time-lapse images of myosin-GFP and membrane- 
mCherry. These data sets have a z resolution of 1 jm and cover a 10 jum distance 
from the apical cortex with a temporal resolution ranging between 15 and 22s per 
frame. The intense myosin structures were visually identified from each z slice of 
the image stacks. The total number of myosin bursts was the sum of myosin bursts 
from all z slices throughout the duration of imaging. The frequency was then 
calculated by dividing the total number of bursts by the imaging duration and 
the number of cells in which the bursts were counted. The imaging duration ranges 
between 436 and 689 s. The numbers of initiating and neighbouring cells that were 
counted range between 30 to 41 and 51 to 86, respectively. 

Time-course analyses of junctional positioning and apical domain size 
(Supplementary Fig. 4, 6) were performed using two-photon time-lapse images 
of E-Cadherin-GFP and Resille-GFP. The onset of gastrulation was defined by the 
onset of anterior cell movement driven by the posterior midgut invagination. For 
the analysis of junctional positioning, the central initiating cell of the anterior and 
posterior folds and a third cell that resides in the region between the anterior and 
posterior fold that shows minimal junctional movement were chosen to represent 
the initiating and neighbouring cells. The vertical distance between the visually 
defined centre of the junctional complex and the apex of the cell was measured in 
Image J to represent the positioning of the junctions. For the analysis of apical 
domain size, the central initiating cell of the posterior fold and a representative 
neighbouring cell in the region between the anterior and posterior folds were 
chosen. The apical domain above the junctions was manually outlined based on 
the membrane fluorescence of Resille-GFP and measured for its area and perimeter 
in Image]. 

Correlation analysis between differential junctional displacement and lateral 
membrane curvature (Supplementary Fig. 5) was performed using two-photon 
time-lapse images of E-Cadherin-GFP. Cells that are in the immediate flanking 
regions of the initiating cells and show a marked asymmetry of junctional 
positioning on the opposite sides were chosen for these measurements. The 
differential junctional displacement, which defines the extent of junctional 
asymmetry, was calculated by subtracting the length of the apical domain on 
the distal side (y) from that on the proximal side (x). The lateral membrane 
curvature was defined as the ratio between the height (h) and the chord (C) of 
the arc of lateral membrane on the distal side of the cell. 

Three-dimensional cell shape measurements (Fig. 1j-l) were made in image 
stacks of late cellularization embryos that have been stained for Bazooka and 
Neurotactin and processed for three-dimensional reconstruction as described 
above. The position of Bazooka was defined by the ‘Bazooka junctional triangles’. 
Briefly, an average intensity of Bazooka was first assigned for the voxels that 
intersect with a three-dimensional triangle mesh in the reconstructed cell boundary. 
The Bazooka junctional triangles were then selected based on an intensity threshold 
of the 99th percentile of the Bazooka intensity histogram. For each of the centroids 
of the Bazooka junctional triangles, a three-dimensional principal component ana- 
lysis (PCA) was performed to determine the Bazooka mean position (a point on a 
plane) and the eigenvector corresponding to smallest eigenvalue (plane normal). 
These were then used to define the Bazooka junctional plane that subdivides the cell 
into the apical and basal domains. The geometric measurements were performed as 
follows: three-dimensional PCA was applied to all of the vertices of the triangle 
mesh and the long direction of the cell was defined using the eigenvector corres- 
ponding to the largest eigenvalue. The apical domain length was measured by first 
creating vectors between the cell centroid and each mesh vertex on the apical side of 
the Bazooka junctional plane. These vectors were then projected onto the long 
direction vector of the cell. The length of the longest projected vector was used as 
the apical domain length. The basal domain length was measured similarly, using 
triangle mesh vertices on the basal domain of the cell. The total cell length was 
computed by a sum of the apical and basal length. The apical volume was computed 
by voxelizing the three-dimensional triangle meshes, and summing the volumes of 
voxels apical to the Bazooka junctional plane. Similarly, the apical surface area was 
computed by summing the areas of the mesh triangles for which the triangle 
centroid falls on the apical side of the Bazooka junctional plane. The initiating cells 
were selected on the basis of their location and junctional positioning. 
Approximately 500 dorsal cells in the region between the first and the seventh stripe 
of Runt were ranked by the apical domain length and the top 150 cells were selected 
for further analysis. A second selection was performed to isolate those that are in 
close proximity to the second and fifth stripes of Runt. Of these cells, those whose 
apical domain length was above the average were used for analysis. For the early 
stage embryos that showed no junctional shift, only the location-based selection was 
made. Cells that reside in the region between the anterior and posterior folds with 
junctional positioning that was below average were used as the neighbouring cells. 
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Time-course analysis of Par-1 dynamics (Fig. 2c) was performed using two- 
photon time-lapse images of Par-1-GFP. The average fluorescent intensity of 
Par-1-GFP in manually selected areas consisting of two anterior or four posterior 
initiating cells was measured and normalized by that in areas consisting of four 
neighbouring cells that reside in the regions between the anterior and posterior 
folds. The onset of gastrulation was defined by the onset of anterior cell movement 
driven by posterior midgut invagination. 

Bazooka and Par-1 immunofluorescence (Fig. 2d, e and Supplementary Fig. 11) 
was quantified in image stacks of fixed embryos that were triply labelled for 
Bazooka, Par-1 and Neurotactin and processed for three-dimensional reconstruc- 
tion as described above to define the Bazooka junctional triangles. Junctional 
intensity of Bazooka within a cell was measured within and normalized by the 
Bazooka junctional volume that was defined by the voxelization of Bazooka junc- 
tional triangles. The basal-lateral intensity of Par-1 in each cell was measured 
within and normalized by the volume within a two-voxel distance from the cell 
boundary basal to the Bazooka junctional plane. For Fig. 2d, e, the anterior and 
posterior fold-initiating cells were selected on the basis of location and above- 
average Bazooka positioning, whereas the neighbouring cells were selected from 
the cells that reside in the region between the anterior and posterior folds with 
below-average Bazooka positioning. For Supplementary Fig. 11, the wild-type and 
Bazooka overexpression embryos were fixed, stained and imaged in parallel under 
identical conditions. Cell selection was performed as in Fig. 2d, e. 

Time-course of cell shortening in the initiating cells (Fig. 4b) was analysed by 
measuring the total cell length of five and four initiating cells each from wild-type 
and aPKC mutant embryos in ImageJ. The measurements were normalized by cell 
length at time zero, which was defined by the onset of anterior cell movement 
driven by the posterior midgut invagination. 
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Hsp72 preserves muscle function and slows 
progression of severe muscular dystrophy 


Stefan M. Gehrig’, Chris van der Poel'}, Timothy A. Sayer’, Jonathan D. Schertzer't, Darren C. Henstridge’, Jarrod E. Church't, 
Severine Lamon’, Aaron P. Russell?, Kay E. Davies*, Mark A. Febbraio* & Gordon S. Lynch! 


Duchenne muscular dystrophy (DMD) is a severe and progressive 
muscle wasting disorder caused by mutations in the dystrophin 
gene that result in the absence of the membrane-stabilizing protein 
dystrophin’ *. Dystrophin-deficient muscle fibres are fragile and 
susceptible to an influx of Ca”*, which activates inflammatory and 
muscle degenerative pathways*®. At present there is no cure for 
DMD, and existing therapies are ineffective. Here we show that 
increasing the expression of intramuscular heat shock protein 72 
(Hsp72) preserves muscle strength and ameliorates the dystrophic 
pathology in two mouse models of muscular dystrophy. Treatment 
with BGP-15 (a pharmacological inducer of Hsp72 currently in 
clinical trials for diabetes) improved muscle architecture, strength 
and contractile function in severely affected diaphragm muscles in 
imdx dystrophic mice. In dko mice, a phenocopy of DMD that results 
in severe spinal curvature (kyphosis), muscle weakness and 
premature death”*, BGP-15 decreased kyphosis, improved the 
dystrophic pathophysiology in limb and diaphragm muscles and 
extended lifespan. We found that the sarcoplasmic/endoplasmic 
reticulum Ca**-ATPase (SERCA, the main protein responsible for 
the removal of intracellular Ca’) is dysfunctional in severely 
affected muscles of mdx and dko mice, and that Hsp72 interacts with 
SERCA to preserve its function under conditions of stress, ultimately 
contributing to the decreased muscle degeneration seen with Hsp72 
upregulation. Treatment with BGP-15 similarly increased SERCA 
activity in dystrophic skeletal muscles. Our results provide evidence 
that increasing the expression of Hsp72 in muscle (through the 
administration of BGP-15) has significant therapeutic potential 
for DMD and related conditions, either as a self-contained therapy 
or as an adjuvant with other potential treatments, including gene, 
cell and pharmacological therapies. 

DMD is the most severe form of muscular dystrophy; it affects about 1 
in 3,500 live male births'. Intracellular Ca”* regulation is compromised 
in dystrophic muscle fibres, which triggers chronic inflammation, 
repeated cycles of degeneration with progressively ineffective regenera- 
tion, and infiltration of fibrotic and other non-contractile material’. 
Mechanisms for the influx of Ca’ into dystrophic muscle fibres include 
membrane tears‘, stretch-activated channels’, Ca”* leak channels!® 
and leaky Ca’* release channels", and it has been speculated that the 
function of SERCA, the main protein responsible for Ca” * reuptake into 
the sarcoplasmic reticulum (SR), is compromised’. Increasing 
SERCA pump expression within dystrophic muscles in transgenic mice 
or through viral-mediated delivery improves Ca** handling and sup- 
presses the pathological cascade of events'*”*. The role of inflammation 
in the dystrophic pathology is well known, particularly that of the pro- 
inflammatory cytokine tumour necrosis factor-a (TNF-«)'®. TNF-« 
activates the nuclear factor-kB (NF-«B) and c-Jun N-terminal kinase 
(JNK) signalling pathways'”’. Hsp72 is a molecular chaperone protein 
that inhibits inflammatory mediators including p-JNK, TNF-o and the 


NF-«B pathway”, and binds and preserves SERCA function under 
conditions of cellular stress”. Although some studies have shown 
Hsp72 to be elevated in patients with DMD, there is little consensus, 
because expression data for young patients are variable and sourcing 
age-matched controls is problematic (Supplementary Fig. 1a). 
Nevertheless, the endogenous heat shock protein response in DMD 
is insufficient to be protective. Here we tested the hypothesis that 
increasing the levels of Hsp72 protects dystrophic muscles from func- 
tional deterioration. We bred dystrophin-null mdx dystrophic mice 
with mice showing a muscle-specific transgenic (TG) overexpression 
of Hsp72, producing mdx'S™) mice and mdkx littermate controls 
(Fig. 1a; see Supplementary Fig. 1b for quantification). At about 25 
weeks of age, serum levels of creatine kinase (CK), a classic indicator of 
muscle breakdown, were decreased in mdx'@"*) mice compared with 
littermate control mice lacking the transgene (mdx'S’; Fig. 1b). 
Because most patients with DMD show severe weakness* and/or 
muscle fatigue**, we assessed whole-body strength and endurance in 
dystrophic mice by using a hang test to measure latency-to-fall, 
which was significantly improved in mdx™@? mice (24 + 3s versus 
64 + 14s; P= 0.002, n = 20 mice). Respiratory failure is the cause of 
death in up to 90% of patients with DMD”, and because diaphragm 
function is an accurate predictor of respiratory insufficiency we inves- 
tigated the effect of Hsp72 overexpression on the pathophysiology of 
the diaphragm in dystrophin-deficient mice. The progressive degen- 
eration of the diaphragm in mdx mice closely mimics that in DMD”. 
Gross histological analyses revealed that the diaphragm pathology 
in mdx'® mice was ameliorated compared with age-matched 
littermate control mdx'S™ mice (Supplementary Fig. 1c), an obser- 
vation supported by the minimal Feret’s diameter variance coefficient, 
which provides a sensitive measure of fibre heterogeneity and the 
dystrophic pathology’. We found a lower Feret’s diameter variance 
coefficient in mdx!°? mice than in mdx!o” mice, indicative of an 
improved phenotype (405+5 versus 358+5; P<0.001, n=5). 
Damaged myofibres can be revealed by the infiltration of Evans blue 
dye (EBD) entering the myoplasm through tears in the sarcolemma**. 


(Fig. 1c and Supplementary Fig. 1d), indicative of decreased necrosis 
and hastened overall repair of damaged fibres rather than improved 
structural integrity. To support this contention, we performed well- 
described in situ and in vitro contraction-induced injury protocols on 
tibialis anterior (TA) and diaphragm muscles, respectively (see 
Methods). No differences in contraction-mediated damage were 
evident between muscles of mdx'@ and mdx'’S™ mice (Sup- 
plementary Fig. le), indicating that structural integrity was unaltered. 
Expression of the dystrophin homologue, utrophin, a protein known to 
compensate for the loss of dystrophin, was also unchanged (Sup- 
plementary Fig. 1f). We assessed collagen infiltration in sections of 
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Figure 1 | Transgenic Hsp72 overexpression increases muscle strength, 
decreases muscle breakdown and improves diaphragm muscle histological 
parameters in mdx mice. a, Representative western blot detection of Hsp72 in 
diaphragm muscle homogenates from non-dystrophic (WT) and dystrophic 
(mdx) Hsp72 transgene-negative (TG—) and transgene-positive (TG+) mice. 
GAPDH, glyceraldehyde-3-phosphate dehydrogenase. b, Whole-body muscle 
breakdown, measured by serum creatine kinase (CK) levels. Asterisk, P< 0.05, 
n= 6. c, Quantified mean data for EBD-positive area in diaphragm muscle 
sections. Asterisk, P< 0.05, n = 4. d, Representative images of collagen 
infiltration in diaphragm sections (revealed with Van Gieson’s stain) from 
mdx'S™ and mdx'S™ mice. e, Specific (normalized) force of diaphragm 
muscle strips measured in vitro. Asterisk, P< 0.05, n = 5. All data are from 25- 
week-old mice. Scale bar, 200 um. Data are shown as means + s.e.m. 
diaphragm and found that mdx'° mice had less collagen infiltration 
than mdx'°” mice at both 25 weeks of age (Fig. 1d and Supplemen- 
tary Fig. 1g) and 80 weeks of age (data not shown). Normalized force 
production was significantly higher in diaphragm muscle strips from 
mdx'S) mice than in those from mdx'C™ mice (Fig. le). 

We have shown that Hsp72 can block inflammation in mice 
in vivo’, and because inflammation promotes muscle degeneration 
in mouse models of DMD” we examined the effect of Hsp72 on 
expression of p-JNK, p-IKK (a key mediator of NF-«B activation) 
and TNF-o. Western blot and polymerase chain reaction (PCR) 
analyses revealed no difference in these inflammatory markers in dia- 
phragm muscles from mdx'S™ and mdx'S™ mice (Supplementary 
Fig. 2, and data not shown). There was significantly decreased 
messenger RNA expression of macrophage markers CD68 and F4/ 
80 and TNF-« in TA muscles of mdx'?*) mice (Supplementary 
Fig. 3), but these did not translate to functional improvements (data 
not shown). We next examined SERCA activity in diaphragm muscles 
from wild-type (WT) and mdx mice and showed a progressive age- 
related decline in maximal SERCA activity in mdx mice (Fig. 2a and 
Supplementary Fig. 4a), despite an age-related increase in SERCA 
protein expression (Supplementary Fig. 4b). This functional decline 
is attributed, in part, to post-translational modifications of the SERCA 
protein, especially nitrosylation, which decreases maximal SERCA 
activity as a result of changes in Ca**-binding and ATP-binding 
domains”. Alterations in the Ca* -binding domain changes SERCA 
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Figure 2 | Maximal SERCA activity is decreased in mouse models of 
dystrophy; Hsp72 binding improves SERCA function. a, Maximal SERCA 
activity in diaphragm muscle homogenates from WT and mdx mice at 3, 10 and 
30 weeks of age. Asterisk, P< 0.001, n = 4. b, Maximal SERCA activity of 
muscle homogenates from quadriceps and diaphragm from 10-week-old WT 
and dko mice. Asterisk, P = 0.036. Two asterisks, P< 0.001, n = 4. c, Maximal 
SERCA activity in muscle homogenates from mdx"°™ (pale grey bars) and 
mdx'S (dark grey bars) mice. Asterisk, P< 0.008, n= 9. d, Ca? 
accumulation curves (an indirect measure of SERCA activity) for SR in single 
fibres from extensor digitorum longus muscles of mdx'°™ and mdx'S 
mice. Asterisk, P< 0.01, n= 15. e, Maximal SERCA activity after incubation 
for 5 min at various concentrations of peroxynitrite (ONOO _), measured in 
enriched SR vesicles isolated from muscles of mdx’S™ and mdx'@, SERCA 
activity was normalized to maximal activity in the absence of ONOO . 
Asterisk, P < 0.05, n = 3. f, Representative western blots of enriched SR vesicles 
from mdx'S and mdx'* (n = 4) mice, showing Hsp72 protein expression 
and Coomassie blue stain (showing SERCA isoforms). Data are shown as 
means + s.e.m. 


enzyme kinetics, decreasing Ca** sensitivity and increasing [Ca**]so 
(the [Ca?*] required to achieve half-maximal enzyme activity), an 
effect we observed in diaphragm muscles from 30-week-old mdx mice 
(Supplementary Fig. 4c). Similar deficits in SERCA activity were evid- 
ent in both limb and diaphragm muscles of severely affected dko mice 
(Fig. 2b), indicating that abnormal SERCA function may contribute to 
the disruptions in Ca** regulation characteristic of dystrophic 
muscles. Indeed, recent evidence suggests that closer regulation of 
Ca’* homeostasis through enhanced SERCA expression or activity 
significantly suppresses the degeneration of dystrophic muscle’*”’. 
Because Hsp72 binds SERCA and prevents functional inactivation 
under conditions of cellular stress”*, we next tested the hypothesis that 
overexpression of Hsp72 in dystrophic muscles would improve SERCA 
function. We examined maximal SERCA activity in homogenates of 
, : F oe TG(+) 
quadriceps muscles, and found an increase in activity in mdx 
compared with mdx'° mice (Fig. 2c). To support this finding we 
also examined Ca?* accumulation in SR (an indirect measure of 
SERCA activity) in single muscle fibres dissected from fast-twitch 
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extensor digitorum longus, and predominantly slow-twitch soleus 
muscles, and found a similar increase in maximal SERCA activity in 
individual muscle fibres in mdx'@ compared with mdx'S™ mice 
(as for whole-muscle homogenates), with no changes in mRNA for 
SERCA or in protein expression (Fig. 2d, Supplementary Fig. 5 and 
Supplementary Table 1). We then examined whether Hsp72 over- 
expression could protect SERCA function under conditions of 
stress, like that induced by reactive oxygen-nitrogen species such as 
peroxynitrite (ONOO ). We developed a SERCA activity assay in 
which enriched SR vesicles isolated from mdx'@” and mdx'° mice 
were incubated with increasing concentrations of ONOO ; the sub- 
sequent suppression in activity was normalized to that in the absence of 
ONOO . We found that SERCA activity was greater in the presence of 
various ONOO concentrations in enriched SR vesicles from 
mdx'&™ compared with mdx!S mice (Fig. 2e), with western blots 
revealing Hsp72 protein levels were highly elevated in enriched SR 
vesicles from mdx'° mice (Fig. 2f see Supplementary Fig. 6 for full 
blots), indicating that Hsp72 was bound within the SR to mediate this 
protective effect. These data indicate that Hsp72 overexpression in 
dystrophic muscles can protect SERCA from maces modifica- 
tions and is a likely mechanism of protection in mdx'@*? mice. 

Given the significant phenotypic improvements in mdx'®*? mice, 
especially in the diaphragm, we examined whether similar effects could 
be achieved through the pharmacological or heat-therapy induction of 
Hsp72. BGP-15 is a pharmacological inducer of Hsp72 that can protect 
against obesity-induced insulin resistance” and is in Food and Drug 
Administration (FDA)-approved Phase II clinical trials for diabetes”. 
Dystrophic mdx mice were treated from 4 to 9 weeks of age (5 weeks) 
or from 4 to 16 weeks of age (12 weeks) with BGP-15 (15 mg kg! per 
day, oral gavage). WT mice were used for comparisons with mdx mice. 
Hsp72 protein expression was elevated in diaphragm muscle homo- 
genates from BGP-15-treated mdx mice (and heat-therapy-treated 
mice) compared with untreated mdx mice (Fig. 3a; see Supplemen- 
tary Fig. 7a for quantification). CK levels were decreased in BGP-15- 
treated mdx mice compared with untreated mdx mice (Fig. 3b). EBD 
infiltration was also reduced with long-term BGP-15 treatment 
(Fig. 3c). Strength and endurance was evaluated with the inverted hang 
test; WT mice were stronger than mdx mice, and BGP-15-treated mdx 
mice showed an increased latency-to-fall compared with control 
(Supplementary Fig. 7). An increase in fibrosis, as seen in DMD‘, 
was observed in the mdx diaphragm (compared with WT), and 
BGP-15 treatment significantly decreased fibrosis (Fig. 3d; see 
Supplementary Fig. 7 for quantification). Treatment with BGP-15 
attenuated the functional deterioration of the diaphragm muscle sig- 
nificantly (Fig. 3e). Maximal SERCA activity in diaphragm homoge- 
nates was increased in mdx mice after long-term treatment with 
BGP-15, indicating a mechanism consistent with that of transgenic 
Hsp72 overexpression (Fig. 3f). Because elevated core temperature is 
a potent inducer of heat shock proteins**”°, we also extensively tested 
this method of heat shock protein induction (see Methods). Similar 
beneficial effects to those observed with transgenic Hsp72 overexpres- 
sion and BGP-15 treatment were seen in mdx mice exposed to repeated 
heat therapy (see Supplementary Fig. 8). 

We then investigated whether treatment with BGP-15 was protective 
in severely affected dystrophic dko mice, the most phenotypically 
accurate murine model of DMD”* (see Methods). The dko mice were 
treated with BGP-15 from 3-4 weeks until 10 weeks of age. For the 
survival study, dko mice were treated from 3-4 weeks until death or 
humane killing (as described in Methods). Photographs of control and 
BGP-15-treated dko mice were taken at 10 weeks, immediately before 
killing, and after evisceration and staining of the skeleton with alizarin 
red (Fig. 4a). Data from WT mice (as in Fig. 3) were also used for 
comparisons with dko mice. Because boys with DMD have significant 
paraspinal muscle weakness and in many cases severe kyphosis (spinal 
curvature)**, this was quantified in conscious mice (from the 10-week 
endpoint cohort) by a blinded investigator using a 1-5 index of 
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Figure 3 | Pharmacological induction of Hsp72 ameliorates muscular 
dystrophy in mdx mice. a, Representative western blot detection of Hsp72 in 
diaphragm muscle homogenates of control (Con.) and BGP-15-treated mdx 
mice. b, Whole-body muscle breakdown, measured by serum CK levels in WT 
(C57BL/10), mdx (control) and BGP-15-treated mdx mice. Asterisk, P < 0.01, 
n= 8. c, Quantified mean data for EBD infiltration in mdx mice treated for 12 
weeks. Asterisk, P< 0.05, n = 8. d, Representative images of collagen 
infiltration in diaphragm muscle sections. e, Specific force production in 
diaphragm muscle strips measured in vitro. Asterisk, P< 0.01, n= 9. 

f, Maximal SERCA activity in diaphragm muscle homogenates from mdx mice 
treated for 12 weeks. Asterisk, P< 0.05, n = 6. Scale bar, 200 um. WT data are 
used as a reference control. All treated mice received BGP-15 for 5 weeks unless 
stated otherwise. Data are shown as means = s.e.m. 


kyphosis, 1 indicating no spinal deformity on palpation and 5 being 
the most severe. Treatment with BGP-15 decreased kyphosis markedly 
compared with vehicle-treated controls (Fig. 4b). Serum CK levels were 
significantly lower in dko mice after treatment with BGP-15 (Fig. 4c), 
indicating decreased whole-body muscle breakdown. Collagen 
infiltration was decreased in the diaphragm of dko mice after treatment 
with BGP-15 (Fig. 4d) and the force-producing capacity of diaphragm 
muscle strips and intact TA muscles (measured in situ) was increased 
significantly in BGP-15-treated dko mice, with maximum force 
restored to WT levels in the TA muscle (Fig. 4e, f and Supplemen- 
tary Table 2). No differences in body mass or in calcification or central 
nucleation within diaphragm muscles were evident in dko mice after 
treatment with BGP-15 (data not shown). However, the most import- 
ant outcome was that lifelong treatment of dko mice with BGP-15 
significantly extended survival (Fig. 4g,h; P< 0.05; 27% increase in 
median lifespan). This finding has clinical relevance for DMD. 

Our findings reveal that transgenic Hsp72 overexpression improves 
several pathological indices in mdx dystrophic mice, at least in part by 
preserving or improving SERCA function. Furthermore, treatment of 
dystrophic mice with BGP-15, a known pharmacological co-inducer of 
Hsp72, ameliorated the dystrophic pathology and extended the 
lifespan in dko mice. Taken together, these results indicate that induc- 
tion of Hsp72 in muscular dystrophy is an important and novel 
therapeutic approach that can improve the dystrophic pathology 
and attenuate the disease progression. Although an ultimate cure for 
DMD is likely to be derived from gene or cell therapies, considerable 
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Figure 4 | Treatment with BGP-15 decreases kyphosis (spinal curvature), 
improves muscle function and prolongs lifespan in severely dystrophic dko 
mice. a, Left: representative photographs of WT (C57BL/10), dko and BGP-15- 
treated dko mice. Right: representative eviscerated skeletal preparations showing 
bone structure (pink), and highlighting spinal curvature in dko mice. b, Spinal 
curvature was quantified (1-5) in WT, dko and BGP-15-treated dko mice. 
Asterisk, P< 0.05, n = 10. c, Whole-body muscle breakdown, measured by 
serum CK levels. Asterisk, P< 0.05, n = 8. d, Representative images of collagen 
infiltration in diaphragm sections. e, Specific force of diaphragm muscle strips 
measured in vitro. Asterisk, P< 0.01, n = 9. f, Maximal force production in TA 
measured in situ. Asterisk, P< 0.01, n= 9. g, Survival curve of untreated 
(control) and BGP-15-treated dko mice. P< 0.05, n = 14. h, Scatter-plot of dko 
lifespan with a line showing median survival. Scale bar, 200 um. WT data are 
used as a reference control. Data are shown as means + s.e.m. 


obstacles need to be overcome before these approaches can be con- 
sidered safe and effective. Until these concerns are obviated, alterna- 
tive (and potentially synergistic) therapies, such as pharmacological 


LETTER 


induction of Hsp72, could delay the disease progression to allow many 
patients to benefit from perfected treatments. 

In a recent clinical trial for patients with insulin resistance, treat- 
ment with BGP-15 (200 mg and 400 mg once a day for 28 days) sig- 
nificantly increased sensitivity to insulin. The dose schedule of BGP-15 
was well tolerated, with both doses being safe: there were no clinically 
significant changes in physical status or in laboratory or electro- 
cardiogram parameters”. Given that BGP-15 is currently used in 
clinical trials for other pathologies, our findings identify it as a tangible 
and realistic treatment method for patients with DMD in the near 
future. 


METHODS SUMMARY 


Animals. All experiments were approved by the Animal Ethics Committee of The 
University of Melbourne and conducted in accordance with the Australian code of 
practice for the care and use of animals for scientific purposes as stipulated by the 
National Health and Medical Research Council (NHMRC, Australia). Male mice 
were used for all experiments. Wild-type (C57BL/10) and dystrophic mdx mice 
were sourced from the Animal Resources Centre (Canning Vale, Western 
Australia). 

Muscle functional analysis. Mice were anaesthetized with sodium pentobarbitone 
(Nembutal) such that they were unresponsive to tactile stimuli. Contractile 
properties of diaphragm muscle strips were assessed in vitro. 

Morphological analysis. Muscles were trimmed of tendons and adhering non- 
muscle tissue, mounted in embedding medium, frozen in liquid-nitrogen-cooled 
isopentane, and stored at —80 °C. Transverse muscle sections were cryosectioned 
from the mid-belly of each muscle. Muscle collagen content was assessed from 
Van Gieson’s stained cross-sections that were quantified. 

SERCA activity assay. Ca~*-dependent SERCA activity was assessed in isolated 
enriched SR vesicles and whole-muscle homogenates. For whole-muscle homo- 
genates, muscles were surgically excised from anaesthetized mice and stored at 
—80°C for subsequent analyses. For enriched SR vesicles, mixed hindlimb mus- 
cles (quadriceps, gastrocnemius, extensor digitorum longus, soleus and plantaris) 
and diaphragm muscles were homogenized and subjected to sucrose gradient 
differential centrifugation using a Thermo Scientific Sorvall WX100 ultracentri- 
fuge with a T-890 fixed-angle rotor. During the entire homogenization and SR 
vesicle isolation procedures, samples were immersed in ice to avoid temperature- 
dependent decreases in SERCA activity. 

Skeletal preparation. To reveal skeletal architecture, mice (after death) were 
skinned, eviscerated and placed in a KOH solution (1.5% w/v), for 5 days. KOH 
solution was replaced and a small amount of alizarin red was added to stain 
calcium deposits; the preparation was left for a further 5 days. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Animals. Female mdx mice were crossed with male mice expressing a rat inducible 
Hsp72 transgene under the control of a B-actin promoter*!. F, generation males 
were mated with female mdx mice to pe an equal proportion of mdx' and 
mdx'°™ littermate controls. WT'° and WT"? mice were generated as 
described previously”. Genotyping was performed by PCR using primers 
described previously’. C57BL/10 and mdx mice used for heat therapy and 
BGP-15 treatment studies were obtained from the Animal Resources Centre 
(Canning Vale, Western Australia). Dystrophic mdx mice were treated from 
4 weeks to 9 weeks of age (5 weeks) or from 4 weeks to 16 weeks of age (12 weeks) 
with BGP-15 (15mgkg * per day, oral gavage; N-Gene R&D Inc.). BGP-15 is a 
pharmacological inducer of Hsp72, which has been shown to be safe and well 
tolerated in FDA-approved Phase II clinical trials for diabetes and insulin resist- 
ance’, For heat therapy, the mice were anaesthetized and the core body tem- 
perature was raised to about 40 °C with an infrared thermometer for 30 min every 
fourth day; body temperature was monitored from the external auditory meatus as 
described previously*’. The dko mice were bred in the Biological Research Facility 
at the University of Melbourne®. The dko mice are utrophin-null on an mdx 
background and show a severe and progressive muscular dystrophy that more 
closely mimics the phenotypic characteristics of DMD’*. The dko mice were 
treated with BGP-15 from 3-4 weeks to 10 weeks of age. For the survival study, 
dko mice were treated from 3-4 weeks until death or humane killing in accordance 
with the Animal Ethics Committee of The University of Melbourne (AEC). All 
experiments were approved by the AEC and conducted in accordance with the 
Australian code of practice for the care and use of animals for scientific purposes as 
stipulated by the National Health and Medical Research Council (Australia). Male 
mice were used for all experiments. 

Muscle functional analyses. Mice were anaesthetized with sodium pentobarbitone 
(Nembutal; 60mg kg! intraperitoneal; Sigma-Aldrich, New South Wales, 
Australia) such that they were unresponsive to tactile stimuli. Contractile properties 
of diaphragm muscle strips were assessed in vitro as described in detail previously”’. 
In brief, strips of diaphragm muscle were bathed in oxygenated Krebs solution at 
25 °C in a custom-made organ bath. Preparations were stimulated by supramaximal 
0.2-ms square-wave pulses of 450 ms duration delivered by means of platinum plate 
electrodes flanking both sides of the muscle. Contractile properties of the TA muscle 
were assessed in situ*’. In brief, TA muscles maintained at 37 °C were stimulated by 
supramaximal (14 V) 0.2-ms square-wave pulses of 350 ms duration delivered by 
means of two wire electrodes resting on the peroneal nerve. All muscles were 
adjusted to optimum muscle length (L,), determined from maximum isometric 
twitch force (P,). Maximum isometric tetanic force (P,) was recorded from the 
plateau of the frequency-force relationship, and normalized to muscle cross- 
sectional area (specific force; sP,), for comparisons between groups where 
appropriate****. The susceptibility of TA and diaphragm muscles to contraction- 
induced injury was determined from the protocols that we have described in detail 
previously****. Isolated muscles were maximally activated to produce isometric 
force and then stretched to perform an eccentric contraction (at a velocity of 
2L¢s- ') at progressively increasing magnitudes of stretch. Maximum isometric 
force was determined before each eccentric contraction”. 

Morphological analysis. Muscles were trimmed of tendons and adhering non- 
muscle tissue, mounted in embedding medium, frozen in liquid-nitrogen-cooled 
isopentane, and stored at —80°C. Transverse muscle sections (5|1m) were 
cryosectioned from the mid-belly of each muscle. Sections were stained with 
haematoxylin/eosin to reveal general muscle architecture. Cross-sectional area 
(CSA) and minimal Feret’s diameter was assessed in about 500 fibres from each 
section of diaphragm muscle, with the use of Carl Zeiss software (Axiovision 
4.6.3). The minimal Feret’s diameter is the minimum distance between opposing 
parallel tangents of a muscle fibre*’, and the variance coefficient is a highly 
sensitive parameter used for detecting differences between dystrophic and 
otherwise healthy muscles*. Muscle collagen content was assessed from Van 
Gieson’s stained cross-sections that were quantified with Axiovision 4.6.3 
software. 

Enriched SR isolation. Mixed hindlimb muscles (quadriceps, gastrocnemius, 
extensor digitorum longus, soleus and plantaris) and diaphragm muscles were 
diluted in ice-cold homogenization buffer (250 mM sucrose, 5 mM HEPES pH 7.0, 
0.2% NaN;). Protease inhibitor cocktail (P-8340; Sigma-Aldrich, Castle Hill, New 
South Wales, Australia) was added immediately before homogenization at 5 ll per 
100 mg muscle wet weight. The muscles were minced on ice with surgical scissors 
and homogenized with a Polytron homogenizer (PT2100; Kinematica, Inc. 
Dispersing and Mixing Technology, New York) at a power setting of 21 for three 
20-s bursts separated by 45-s breaks on ice. To obtain a purified and enriched SR 
membrane fraction, the homogenates were centrifuged at 5,500g for 10 min at 4 °C 
to remove insoluble material. The supernatants were harvested and centrifuged at 
12,500g for 18min at 4°C. The pellet was discarded and the supernatant was 
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centrifuged for a second time at 12,500g for 18 min at 4°C. Supernatants were 
transferred to ultracentrifuge tubes, which were balanced and centrifuged at 
50,000g (24,200 r.p.m. on a T-890 fixed-angle rotor; Thermo Scientific Sorvall 
WX100 ultracentrifuge) for 1h at 4°C. Supernatants were discarded and pellets 
were resuspended in 5 ml of homogenization buffer containing 600 mM KCl and 
incubated on ice for 30 min. Samples were centrifuged at 15,000g for 10 min at 4 °C 
to pellet and remove mitochondria. Supernatants were centrifuged again at 
50,000g for 1h at 4°C. The final pellet (enriched SR membrane vesicles) was 
resuspended in homogenization buffer. Protein content was determined in 
triplicate*’. 

SERCA activity assay. Ca”*-dependent SERCA activity was assessed in isolated 
enriched SR vesicles and whole-muscle homogenates on the basis of the methods of 
Leberer and colleagues, as described previously*’. For whole-muscle homogenates, 
muscles were surgically excised from anaesthetized mice and stored at —80°C. 
Muscle samples (about 20-50 mg) were diluted in about 200 1l of ice-cold homo- 
genization buffer. Protease inhibitor cocktail (P-8340) was added immediately 
before use at a concentration of 5 ul per 100 mg of muscle tissue. Muscles were 
homogenized with a hand-held glass homogenizer, and then centrifuged for 10 min 
at 5,500g at 4 °C. Supernatant protein concentration was determined in triplicate*. 
Protein concentration was adjusted to 10 mg ml — | when possible, with homogen- 
ization buffer. SERCA activity in whole-muscle homogenates was determined in 
reaction buffer (200mM KCl, 20mM HEPES, pH7.0, 15mM MgCl, 10mM 
NaN;, 10mM phosphoenolpyruvate, 5mM ATP, 1mM EGTA). SERCA activity 
in enriched SR vesicles was determined in reaction buffer (100 mM KCl, 20 mM 
HEPES, 10 mM MgCl, 10mM NaN;, 10 mM phosphoenolpyruvate, 5 mM ATP, 
and 1mM EGTA). The pH of both reaction buffers was adjusted to 7.0 at 37 °C. 
Immediately before starting the reaction, 18 Uml ! PK, 18Uml? lactate 
dehydrogenase (LDH), 5ul NADH (100mM), 1M calcimycin A-23187 
(Sigma-Aldrich) and about 10 pl of whole-muscle homogenate or about 3 ul of 
enriched SR vesicles were added to 1 ml of reaction buffer in a plastic cuvette. 
Cuvettes were loaded into a spectrophotometer and A349 was measured at 37 °C 
(Multiscan Spectrum; Thermo Electron, Waltham, Massachusetts). Maximal 
(Vmax) and Ca**-dependent SERCA activities were determined by progressively 
adding Ca”* until a plateau or maximal activity was reached. The specific SERCA 
inhibitor 2',5’-di(tert-butyl)-1,4-benzohydroquinone (TBQ) was added to a final 
concentration of 401M to determine basal activity. SERCA enzyme kinetic 
parameters, determined from a regression analysis, were the Ca?* concentra- 
tion required to elicit 50% maximal SERCA activity ({Ca”*]59) and the Hill coef- 
ficient, which is a measure of the cooperativity of Ca** binding of the SERCA 
enzyme. SERCA activity was graphed against added Ca** concentration. Non- 
regression analysis was performed using the following variable-slope sigmoidal 
dose-response relationship, using Graphpad Prism v.3.02 (GraphPad Software 
Inc., San Diego): 
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Peroxynitrite-mediated SERCA inhibition assay. SERCA activity in enriched 
SR vesicles after incubation in various concentrations of peroxynitrite 
([ONOO ]) was measured. [ONOO™] was measured spectrophotometrically 
(e = 1,670 Mcm at 302 nm, about 68 mM in stock). Enriched SR vesicle samples 
were added to reaction buffer (as described previously) in the presence of ONOO ~ 
(0, 10, 50, 100, 250, 500 or 1,000 11M) and incubated for 5 min at 37 °C ina plastic 
cuvette. ONOO™ and byproducts were quenched by the addition of 1mM 
dithiothreitol (to prevent the inactivation of PK and LDH subsequently added); 
18Uml ! PK, 18 Uml! LDH and 5 ul of NADH (100 mM) were then added. 
Cuvettes were loaded into a spectrophotometer, and A349 was measured at 37 °C. 
Maximal SERCA activity was determined by adding Ca** until a plateau or 
maximal activity was reached. Once maximal activity had been determined, the 
specific SERCA inhibitor TBQ was added to a final concentration of 40 uM to 
determine basal activity. Maximal SERCA activity in each sample was assessed 
independently at various ONOO concentrations (10, 50, 100, 250, 500 or 
1,000 11M) and expressed as a percentage of the maximal SERCA activity in the 
absence of ONOO . 

Human DMD samples. Human muscle specimens were sourced from the 
Telethon Network of Genetic Biobanks, Italy. Biopsies were taken from the vastus 
lateralis muscle of patients with DMD (aged 1-9 years) or healthy controls (aged 
18-25 years) using either a Bergstrém or open biopsy technique. All biopsies were 
frozen immediately in liquid nitrogen and stored at —80 °C until analysis. Muscle 
samples were homogenized in RIPA buffer (Millipore, Billerica, Massachusetts) 
and rotated for 1h at 4°C, followed by centrifugation for 15 min at 4°C. The 
supernatant was collected and protein concentration was determined by means of 
the bicinchoninic acid protein assay (Pierce Biotechnology). Electrophoresis was 
performed by 10% SDS-PAGE in a buffer containing 12 mM Tris-HCl pH 8.8, 
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200 mM glycine and 0.1% SDS. Protein transfer was performed in a buffer contain- 
ing 12 mM Tris-HCl pH 8.3, 200 mM glycine and 10% methanol with the use of 
poly(vinylidene difluoride) membranes. The membranes were blocked with 5% 
BSA in PBS, after which they were incubated overnight at 4°C with the primary 
antibody against Hsp70 (SPA-812; Stressgen) diluted 1:1000 in PBS. After being 
washed, the membranes were incubated for 1h with a goat anti-rabbit IgG 
antibody labelled with an infrared-fluorescent 800-nm dye (Alexa Fluor 800; 
Invitrogen) diluted 1:5,000 in PBS containing 50% Odyssey blocking buffer (LI- 
COR Biosciences) and 0.01% SDS. After being washed, the proteins were exposed 
on an Odyssey Infrared Imaging System (LI-COR Biosciences) and individual 
protein band optical densities were determined with Image] Software (National 
Institutes of Health). The blots were normalized against glyceraldehyde-3- 
phosphate dehydrogenase (GAPDH) protein (G8795; Sigma-Aldrich, Sydney, 
Australia). 

Evans blue dye uptake. To quantify muscle damage and areas of focal necrosis, 
EBD (1% w/v) was injected intraperitoneally (10 ,1l per gram body mass). Muscles 
were frozen 20h after the EBD injection. Sections 10 1m thick were cut on a 
cryostat, and EBD was detected as red autofluorescence with the use of a fluor- 
escence microscope. 

Wire hang test. A wire hang test was employed to assess whole-body muscle 
strength and endurance. Mice were placed on a wire mesh grid, which they 
gripped; the grid was then inverted. Latency-to-fall onto a padded surface was 
recorded in three successive trials, with the average of the best two trials used for 
analyses. 

Single muscle fibre analysis. Mice were anaesthetized with sodium pentobarbitone 
(Nembutal) such that they were unresponsive to tactile stimuli, and the extensor 
digitorum longus and/or soleus muscle was surgically excised for single fibre ana- 
lysis. The muscle was blotted on filter paper and placed in a Petri dish containing 
paraffin oil at room temperature (23 + 2 °C). Muscles were pinned at resting length 
to the base of a dish. Single muscle fibres were isolated from as close to the surface of 
the muscle as possible, and fine forceps were used to peel the sarcolemma away from 
the contractile apparatus under a dissecting microscope. The mechanically skinned 
fibre was then attached to one end of a piezoresistive force transducer (AE801 
SensoNor; Horten) using braided silk (size 10, 0.2 mm; Deknatel), and the other 
end of the fibre was clamped between a pair of forceps fixed to a micromanipulator. 
All experiments were conducted at room temperature. All solutions had pH of 
7.10 + 0.01, and the free Mg’* concentration ([Mg**]) was 1 mM, unless specified 
otherwise. Free [Ca**] at 0.1 1M or more was verified with a Ca’ * -sensitive elec- 
trode (Orion Research). 

Caffeine-induced force responses and SR Ca** accumulation. Initially, mech- 
anically skinned muscle fibres were equilibrated for 30s in a wash solution fol- 
lowed by thorough depletion of SR Ca** stores, achieved by transferring the fibre 
preparation into a release solution containing 30 mM caffeine and 0.02 mM free 
Mg?" . The presence of 0.5 mM EGTA in the release solution ensured that the level 
of Ca** during caffeine-induced release did not maximally activate the contractile 
apparatus, which is necessary to allow quantitative evaluation of the amount of 
Ca’* released. Ca”* release from the SR was estimated from the relative areas 
under the caffeine-induced force response. The fibre was left in the release solution 
for 2min to ensure complete SR Ca’* depletion, before being washed for 30s. 
Thereafter, the SR was reloaded with Ca”* in load solution (0.2 1M Ca** (pCa 
6.7), where pCa = —logy[Ca”*]}) for increasing durations (10, 20, 30 and 60s), 
before being equilibrated for 30s in wash solution; subsequently, SR Ca”* was 
released in release solution. Data were fitted with a standard exponential asso- 
ciation equation to give the rate at which the SR accumulated Ca’* (ins”’) but not 
the amount of SR Ca** accumulated. SR accumulation is an indirect measure of 
SERCA activity”. 

SR Ca** leak. The percentage of Ca** lost from the SR as a result of the passive 
leak was also assessed. The fibre was loaded for 20 s in loading solution. The fibre 
preparation was then placed in wash solution for 30 s followed by SR Ca** content 
released in release solution (Ca’* leak in 30 s). The fibre preparation was placed in 
wash solution before reloading for 20s in load solution and transferred to wash 
solution for 90 s; the remaining SR Ca?* was released in release solution (Ca”* leak 
in 90s). The 30-s Ca”* leak was then repeated, and the area (corrected for pro- 
portionality between area and SR Ca** content) under the test run was divided by 
the average of the areas under the caffeine-induced force responses in the controls 
before and after the test run. This gave an estimate of the fraction of SR Ca”* 
leaked over a 60-s leak period. 

Relative SR Ca’* sensitivity. To determine the effect of Hsp72 overexpression on 
the ryanodine receptor (RyR), a caffeine dose-response curve was determined 
from the forces produced by the contractile apparatus after SR Ca’* release 
induced by low caffeine concentrations. Each fibre was prepared by completely 
depleting the SR of Ca** with 30 mM caffeine followed by a 30-s SR Ca’* reload- 
ing duration. The peak force of caffeine-induced contraction was determined in a 


series of potassium hexamethylene-diamine tetraacetic acid solutions containing 
2,3, 5 and 7 mM caffeine and 50 1M EGTA, with complete SR Ca?* depletion and 
30-s reloading duration of the SR with Ca** between each caffeine contraction. 
The peaks of the caffeine-induced contractions were then normalized as a 
percentage of maximum Ca’*-activated force to estimate the sensitivity of the 
RyR to caffeine. This determined whether overexpression of Hsp72 directly 
affected the function of the RyR. 

Properties of the contractile apparatus. After SR properties were investigated, 
the single muscle fibres were equilibrated in a relaxing solution (pCa > 9) for 
2 min. Fibres were placed in a maximum Ca’ ”-activating solution (pCa ~ 4.5) 
until force reached the maximal value (maximum Ca’" -activated force) and then 
placed back in the relaxing solution for a further 2 min. Force responses were 
generated by exposing the fibre to activating solutions of progressively lower 
pCa (higher [Ca**]) in a stepwise fashion. The force response generated at each 
pCa was expressed as a percentage of the interpolated values for maximum Ca** - 
activated force. Data points were fitted with a Hill equation producing two para- 
meters: the pCasy (the pCa producing half-maximum force) and ny (the Hill 
coefficient, indicative of the steepness of the force-pCa relationship). 

Western blotting. After the determination of SERCA activity in whole-muscle 
homogenates and enriched SR vesicles, electrophoresis was performed on the 
same samples for the quantification of protein expression. Equal amounts of 
protein were resolved in SDS buffer, heated for 5 min at 95°C and separated on 
SDS-polyacrylamide gels. Separated proteins were transferred to poly(vinylidene 
difluoride) membranes (0.45-11m Immobilon-P; Millipore). Membranes were 
blocked with 5% non-fat milk powder (or BSA) in Tris-buffered saline containing 
Tween 20 for 1h and incubated overnight at 4°C with appropriate antibody 
dilutions. Antibodies against SERCA1 (Affinity Bioreagents), SERCA 2a 
(Affinity Bioreagents), Hsp72 (Stressgen), o-tubulin (ECM-Biosciences) and 
GAPDH (Sigma) were used. Membranes were digitized with a chemiluminescent 
detection and imaging system (ChemiDoc XRS; Bio-Rad Laboratories, Hercules, 
California, USA) and band densities were quantified with Quantity One analysis 
software (version 4.6.1; Bio-Rad Laboratories). 

Real-time PCR. Real-time PCR was performed as described previously”. Each 
sample was run in triplicate. The mean reading of each triplicate was converted to 
an absolute content by using a DNA standard curve, based on a serial dilution of 
DNA standard (100-10,000 ng ml; Molecular Probes) run together with the 
samples on each plate. Gene expression was quantified by normalizing the 
logarithmic cycle threshold (C,) value (2¢,) to the cDNA content of each sample 
to obtain the expression 2c,/cDNA content (ng ml‘). In Supplementary Fig. 5b, 
mRNA for SERCA 1 and SERCA 2a was normalized to the eukaryotic 18S house- 
keeping gene. 

Creatine kinase analysis. Serum CK activity was analysed as an overall measure of 
whole-body muscle breakdown. Blood was collected from the tail vein and cen- 
trifuged for 10 min at 10,000g and 4 °C to isolate the serum fraction. Serum CK 
activity was then determined with a commercially available creatine kinase assay 
kit (ECPK-100) in accordance with the manufacturer’s instructions (BioAssay 
Systems). 

Skeletal preparation. Skeletal architecture was revealed as described previously“*. 
After death, the mice were skinned, eviscerated and placed in a KOH solution 
(1.5% w/v), for 5 days. KOH solution was replaced and a small amount of alizarin 
red was added to stain calcium deposits; the preparation was left for a further 
5 days to produce the specimens shown in Fig. 4a. 

Statistical analysis. All values are presented as means + s.e.m. Unpaired Student’s 
t-tests were used to compare between two groups. When comparisons were being 
made between more than two groups, a one-way analysis of variance was used with 
Newman-Keuls post-hoc multiple comparison test. 
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H*-translocating pyrophosphatases (H*-PPases) are active 
proton transporters that establish a proton gradient across the 
endomembrane by means of pyrophosphate (PP;) hydrolysis’”. 
H*-PPases are found primarily as homodimers in the vacuolar 
membrane of plants and the plasma membrane of several protozoa 
and prokaryotes’. The three-dimensional structure and detailed 
mechanisms underlying the enzymatic and proton translocation 
reactions of H*-PPases are unclear. Here we report the crystal 
structure of a Vigna radiata H*-PPase (VrH*-PPase) in complex 
with a non-hydrolysable substrate analogue, imidodiphosphate 
(IDP), at 2.35 A resolution. Each VrH*-PPase subunit consists 
of an integral membrane domain formed by 16 transmembrane 
helices. IDP is bound in the cytosolic region of each subunit and 
trapped by numerous charged residues and five Mg** ions. A 
previously undescribed proton translocation pathway is formed 
by six core transmembrane helices. Proton pumping can be 
initialized by PP, hydrolysis, and H* is then transported into the 
vacuolar lumen through a pathway consisting of Arg 242, Asp 294, 
Lys742 and Glu301. We propose a working model of the 
mechanism for the coupling between proton pumping and PP; 
hydrolysis by H*-PPases. 

Two proton-pumping proteins, vacuolar H*-ATPases (V- 
ATPases) and H’ -PPases, coexist on plant vacuolar membranes and 
use ATP and PP;, respectively, as energy sources for H™ translocation‘. 
Both of these enzymes acidify the vacuolar lumen and establish an 
electrochemical proton gradient across the vacuolar membrane. 
Whereas V-ATPases have been widely studied, the structure and func- 
tion of H-PPases are as yet unknown. The H* -PPase was first iden- 
tified from photosynthetic bacterium Rhodospirillum rubrum’, after 
which the functional and biochemical investigations of vacuolar H™ - 
PPases from plants were reported®’. Subsequently, the protein chemical 
identification of H*-PPases from red beet® and mung bean’ and the 
molecular cloning of the enzyme from Arabidopsis'® were gradually 
completed. Afterwards, the heterologous expression of H*-PPase in 
yeast was established and the sufficiency of PP;-energized H* transloca- 
tion was demonstrated!'. On the basis of these arduous results, a fourth 
category of primary H* pump, H*-PPases, was thus determined”. 

H*-PPases have a high degree of amino acid sequence homology 
(86-91% identity in land plants; Supplementary Fig. 1)’, and they can 
be divided into two subfamilies: typeI (K*-dependent) and type II 
(K*-independent)'*. In Arabidopsis, the H'-PPase regulates organ 
development by modulating the apoplastic pH and eliminating the 
metabolic by-product, cytosolic PP; (refs 14, 15). The overexpression 
of H*-PPases improves tolerance to salinity and drought in many 
higher plants by increasing ion retention’®’’. A PP;-binding motif, 
(E/D)(X)7KXE, and two acidic motifs, DX;DX3D, have been suggested 
as being essential for the enzymatic function of H*-PPases®’*'*. 
Nevertheless, the three-dimensional structure of H*-PPase is 
unknown. We therefore sought to solve its atomic structure to gain 
a better understanding of the machinery of this unique proton pump. 


We determined the crystal structure of VrH* -PPase bound to IDP 
by the multiple-wavelength anomalous dispersion (MAD) method 
and multiple isomorphous replacement with anomalous scattering 
(MIRAS) (Supplementary Table 1). The protein is compactly folded 
in a rosette manner with 16 transmembrane helices (TMs M1-M16) 
within two concentric walls (Fig. 1a). Six TMs (M5, M6, M11, M12, 
M15 and M16) bundle together to form an inner wall, which is in turn 
surrounded by ten additional TMs (M1-M4, M7-M10 and M13- 
M14) that constitute the outer wall (Fig. 1b). The TMs fold sequentially 
counterclockwise in the inner and outer walls with the exception of 
M1, which tilts slightly outward. There are two short helices (#2 and 
a3) on the cytosolic side, and two additional helices («1 and «4) 
together with two antiparallel B strands (B1 and B2) on the luminal 
side (Supplementary Fig. 2). A disulphide bond (C124-C132) was 
detected in loop 2 (Supplementary Fig. 2). Both the amino and carboxy 
termini face the vacuolar lumen (Fig. 1c). Each subunit contains a 
single IDP molecule embedded in the core of the inner wall near the 
cytosolic region. It is conceivable that the inner wall is responsible for 
substrate binding, and the outer wall probably maintains the structural 
integrity of the protein. 

VrH* -PPase is a homodimer with an extensively buried 3,241 A? 
surface between the subunits (Fig. 1c). The electrostatic surface potential 
of the VrH* -PPase dimer is shown in Fig. 1d. The cytosolic region of 
VrH'-PPase has many hydrophilic residues, whereas the vacuolar 
region of the protein protruding out of the membrane is smaller. 
The homodimer has two-fold symmetry and a root mean squared 
deviation of 0.32 A among the Co. atoms between the two subunits 
(Supplementary Fig. 3a). Four TMs (M5, M12, M15 and M16) from the 
inner wall and two TMs (M10 and M13) from the outer wall participate 
in subunit interactions that primarily include hydrophobic inter- 
actions, six hydrogen bonds and two salt bridges (Supplementary 
Fig. 3b-d and Supplementary Table 2). 

The substrate/IDP-binding site is a funnel-shaped pocket formed by 
six core TMs with a solvent-accessible volume of 1,521 A? (Fig. 2a). 
The electrostatic surface potential of the IDP-binding pocket is shown 
in Fig. 2b. The pocket has an unusually acidic environment that contains 
12 acidic residues (Asp 253, Asp257, Glu268, Asp269, Asp 279, 
Asp 283, Asp 287, Asp 507, Asp 691, Asp 723, Asp 727 and Asp 731). 
Three lysine residues (Lys 250, Lys 730 and Lys 694) and one asparagine 
residue (Asn 534) are also found in this pocket (Fig. 2c). All these 
residues are highly conserved among H'-PPases (Supplementary 
Fig. 1). Site-directed mutagenesis studies indicate that most of these 
conserved residues are essential for PP; hydrolysis'**°. In the current 
structure, the IDP molecule was bound directly by three lysine 
residues through hydrogen bonds. In addition, one K* ion and five 
Mg** ions (Mgl1-Mg5) were identified around the IDP molecule at 
the pocket (Fig. 2c). K* is the stimulator of the type  H’ -PPases', and 
Mg*" is essential for the activity of H* -PPases”’. All these Mg** ions 
mediate the interactions between IDP and Asp/Asn residues. The 
binding interactions (Supplementary Table 3) precisely confine the 
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Figure 1 | The crystal structure of the VrH*-PPase-IDP complex. a, Ribbon 
diagram of the overall structure of VrH* -PPase, containing 16 TMs (labelled 
1-16). A missing region (residues 42-66) is shown with dotted lines. b, The six 
inner and ten outer TMs drawn as cylinders and coloured in yellow and green, 
respectively. This orientation is rotated by 60° from that in a. ¢, VrH* -PPase 
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dimer shown as a ribbon diagram with height and width dimensions of 75 A 
and 85 A, respectively. The detergent molecules of n-decyl B-p-maltoside are 
shown as sticks. d, Electrostatic surface potential of the VrH* -PPase dimer 
(red, blue and white indicate negative, positive and neutral potentials, 
respectively). In ac, IDP is shown as sticks and coloured in CPK. 


Figure 2 | The substrate-binding site. a, Six core 
TMs (yellow ribbon) with IDP-binding residues 
(sticks). b, The electrostatic surface potential of the 
IDP binding pocket (red, blue and white indicate 
negative, positive and neutral potentials, 
respectively). c, The IDP-binding residues. Mg”* 
ions, K* ions and water molecules are shown as 
green, purple and red spheres, respectively. 
Interactions are presented as dashed lines. d, The 
binding site of VrH* -PPase (in white) 
superimposed on EcPPase (PDB 2AUU; in pink). 
The Mg*~ ions of the VrH* -PPase (in green, with 
numbers) and EcPPase (in grey) are shown as 
spheres. The F- in EcPPase is shown as a blue 
sphere, and the Wat,,, in VrH* -PPase is shown as a 
labelled red sphere. IDP in VrH ‘ -PPase and PP, in 
EcPPase are coloured in CPK. 
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substrate at the active site in a proper orientation for enzymatic 
hydrolysis. 

The current structure indicates that the highly conserved fragment 
D253-X3-D257-X3-K261-X-E263 from M5, corresponding to the 
DX;KXE motif, is involved in substrate binding'®’’. Both Asp 253 
and Asp 257 interact with IDP through Mgl and Mg3 (Fig. 2c). 
However, Lys 261 and Glu 263 do not participate directly in IDP bind- 
ing. Lys 261 contributes to a salt-bridge network (Supplementary Fig. 4 
and Supplementary Table 4) that connects the core TMs to stabilize the 
active site. Glu263 forms a salt bridge with Arg609 and probably 
contributes to the structural stability of VrH'-PPase. Furthermore, 
the highly conserved acidic motifs D279-X3-D283-X3-D287 and 
D723-X3-D727-X3-D731 as well as all aspartic acids, except Asp 287 
and Asp 731, interact with IDP through either Mg”~ ions or water. 

In Fig. 2d the substrate and the surrounding residues in VrH*- 
PPase are superimposed on those of typeI soluble Escherichia coli 
pyrophosphatase (EcPPase)”. Both enzymes contain several aspartic 
residues for coordinating Mg” * ions to bind the substrate at the active 
site. There are five and four Mg*" ions in the binding pockets of 
VrH' -PPase and EcPPase, respectively. The binding of Mg5 to IDP 
is detected only in VrH'-PPase. The locations of Mg3 and Mgé4 in 
VrH" -PPase are similar to those of the corresponding Mg’ ions in 
EcPPase. In contrast, Mg] and Mg2 are located at distinct positions in 
both enzymes. The distances between Mgl and Mg2 in VrH* -PPase 
and EcPPase are 5.3 and 3.5 A, respectively. In EcPPase, Mg] and Mg2 
bind with a specific inhibitor, F’, which occupies the position of the 
nucleophile close to the P2 phosphate of PP;, to prevent hydrolysis of 
the substrate’. However, a similar binding phenomenon is not 
observed in VrH*-PPase, because the distance between Mgl and 
Mgz2 is too large to bind a nucleophile for attacking PP; (Fig. 2d). 
Instead, in VrH™ -PPase, a water molecule (Wat,,) is found near the 
P1 phosphate of IDP (2.6 A) and forms hydrogen bonds with Asp 287 
and Asp 731, respectively. It is conceivable that Wat,,, might act as a 
nucleophile for PP; hydrolysis in VrH* -PPase. Asp 287 and Asp 731 
are highly conserved among H* -PPases but are replaced by Arg 43 and 
Tyr 141 in soluble EcPPase. F- cannot compete with Wat,, for the 
same binding site, presumably as a result of the repulsion between 
negatively charged F and aspartic acid. H* -PPase is therefore less 
sensitive to F- inhibition than soluble PPase is*. These differences 
suggest that membrane-bound H*-PPases and soluble PPases use 
different strategies to trap the nucleophile for PP; hydrolysis. 

Previous evidence suggests that the dimerization of H* -PPase is 
important for H™ translocation activity”*. However, the possible 
residues that can be protonated as part of proton conduction could 
not be found in the dimer interface in VrH* -PPase. Instead, four 
lined-up charged residues, Arg242, Asp 294, Lys742 and Glu301, 
were observed in a narrow, compact and water-inaccessible trans- 
membrane region of the core TMs (Fig. 3a). These residues are highly 
conserved among H™ -PPases except Glu 301, which is found only in 
higher plants (Supplementary Fig. 1). Glu 301 is located at a rather 
narrow point in the pathway and close to the vacuolar lumen. It might 
therefore act as a constricting neck (Fig. 3a). Mutations at Glu 301 
abolish proton translocation activity (Supplementary Table 5)!°***». 
In addition, Glu301 contributes to the binding of the lipophilic 
carbodiimide N,N’ -dicyclohexylcarbodiimide (DCCD), a potent blocker 
of proton conductance. These data indicate that Glu301 probably 
serves as the proton donor/acceptor in the proton translocation mech- 
anism. Furthermore, Asp 294 and Lys 742 form the only salt bridge 
among these acid-base pairs (Fig. 3a). Lys 742 might reciprocally 
modulate the protonation and deprotonation of Asp 294 and Glu 301, 
similar to that in the regulatory machinery of bacteriorhodopsin”. 
Arg 242, which is located near the PP;-binding site, may form a salt 
bridge with Asp 287 or Asp 731, resulting in the deprotonation of the 
latter residues. Site-directed mutagenesis studies revealed that Arg 242, 
Asp 294 and Lys 742 are crucial for PP; hydrolysis as well as H* trans- 
location (Supplementary Table 5)'*””. Together, these acid-base pairs, 
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Cytosol 


Vacuolar * 
lumen 
(high [H*]) 


Figure 3 | The proton transport pathway of VrH*-PPase. a, The proton 
transport pathway is formed by six core TMs (M5, M6, M11, M12, M15 and 
M16). The solvent-accessible surface area is coloured in cyan. The arrow 
indicates the direction of proton translocation. Right: zoomed-in view of the 
proton transport pathway. The residues involved in proton transport are shown 
and labelled. The solvent-accessible surface has been removed. Bottom: the 
hydrophobic gate around Glu 301. Residues forming a hydrophobic gate are 
displayed and labelled. b, The electron density map (2Fops — Feaic) (in blue) 
around the proton transport pathway drawn at a contour level of 2c. The IDP is 
shown as sticks and coloured in CPK. Water molecules Wat,,,, Wat1 and Wat2 
are presented as labelled red spheres. Water-mediated hydrogen bonds are 
drawn as black dashed lines. 


Arg 242, Asp 294, Lys 742 and Glu 301, form a potential proton trans- 
port pathway surrounded by six core TMs that convey protons from 
the active site to the vacuolar lumen. 

The proton-pumping proteins usually use bound water molecules to 
assist proton transport”’; examples are bacteriorhodopsin” and P-type 
H*-ATPases”’. Nevertheless, two ordered water molecules (Wat1 and 
Wat?) along the proton transport pathway of VrH '-PPase were iden- 
tified from the electron density map (Fig. 3b). Watl forms hydrogen 
bonds with Arg 242 and Asp 294 and seems to be a continuation of the 
bulk solvent at the PP;-binding site. Wat2 was found in the vicinity of 
Lys 742 (with a distance of 3.7 A) and trapped by Asp 294, Ser 298, 
Ser 547, Asn738 and Lys742 with hydrogen bonding to Ser 298, 
Ser 547 and Asn 738. The bound waters, Watl and Wat2, presumably 
facilitate proton transport in the pathway of VrH™ -PPase. 

In the VrH * -PPase-IDP complex, the vacuolar portion of the proton 
transport pathway is relatively narrow and is occupied by hydrophobic 
residues, such as Leu232 (M5), Ala305 (M6), Leu555 (M12) and 
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Figure 4 | A working model for proton pumping in VrH™-PPase. a, Resting 
state (R state). b, Initiated state (I state). c, Transient state (T state). The VrH* - 
PPase dimer is shown and coloured in green and blue. M6 and M16 are shown 


Val 746 (M16) (Fig. 3a). Most of these residues are conserved among 
H* -PPases from various species (Supplementary Fig. 1). They possibly 
form a hydrophobic gate, keeping Glu 301 away from the hydrophilic 
environment of the vacuolar lumen, where the H* concentration is 
higher. Such a gate prevents the H™ from refluxing and maintains 
unidirectional H* translocation from cytosol to lumen. It is conceiv- 
able that the constricted pathway and the acid-base pairs have crucial 
functions in directional proton pumping of the H*-PPase. 

On the basis of previous mutational and biochemical studies 
(Supplementary Table 5) and the present structural findings, we pro- 
pose a working model for the coupling of PP; hydrolysis to proton 
pumping (Fig. 4). In this model, the process involves three sequential 
states: R, I and T. Similarly, the limited trypsinolysis analysis of H*- 
PPase*®' (Supplementary Fig. 5) indicates that the binding pocket of 
H* -PPase is flexible and might be exposed to the solvent in the absence 
of the substrate, and the protein is in a resting state (R state; Fig. 4a). In 
addition, an open conformation at the active site in the absence of 
substrate (PP;) was also supported by a study using single-molecule 
fluorescence resonance energy transfer (FRET)*. In contrast, the 
luminal portion of the core TMs assumes a closed conformation that 
prevents H* from back-flushing into the cytosol. On binding of sub- 
strate (or IDP in our structure), H* -PPase enters an initiated state for 
PP; hydrolysis (I state; Fig. 4b). The core TMs then change into a closed 
conformation on the cytosolic side, locking PP; in the substrate- 
binding pocket. At this stage, the luminal portion of these TMs is 
changed into a semi-open conformation for sequential H* transloca- 
tion. The structure of the VrH*-PPase-IDP complex that we have 
resolved would represent the protein in the I state. On hydrolysis of 
PP;, free phosphates are generated with concomitant proton produc- 
tion (Fig. 4c). Then the proton relay through Arg 242-Asp 294- 
Lys 742-Glu 301 occurs as a chain reaction, and the core TMs adopt 
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se 


T state 


as cylinders. The residues involved in proton transport are shown and labelled. 
PP; and free P; are shown as sticks and coloured in CPK. The proton and 
hydroxyl ions are labelled. The possible salt bridges are shown as dashed lines. 


an open conformation on the vacuolar lumen side for proton release. 
Consequently, H*-PPase proceeds to the transient state (T state; 
Fig. 4c). After the release of P; to the cytosol and translocation of a 
proton to the vacuolar lumen, H*-PPase returns to the R state. This 
proton-pumping mechanism is completed through a series of delicate 
conformational changes driven primarily by the energy derived from 
PP; hydrolysis. Thus, our working model provides a scheme capable of 
accounting for how PP; hydrolysis and the movement of protons from 
one side of the membrane to the other might be accomplished by the 
H' -PPase. 


METHODS SUMMARY 


Vigna radiata H*-translocating pyrophosphatase (VrH*-PPase) fused with a 
C-terminal His-tag was expressed and isolated from Saccharomyces cerevisiae with 
methods that have been described previously. In brief, VrH* -PPase was crystallized 
by using 23% (w/v) PEG2000 precipitant in 50 mM MES pH.6.5. The structural 
phase was determined from the tantalum derivative TagBr,, by MAD and also from 
OsO, and Pt-organe derivatives by MIRAS (Supplementary Table 1). Model build- 
ing was performed with COOT, and the final model was refined with REFMACS5 to 
an R-factor of 16.80% and an Réree Of 20.31% at 2.35 A. The refinement statistics are 
summarized in Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cloning, expression and purification. Vigna radiata H’*-translocating 
pyrophosphatase (VrH™ -PPase) fused with a C-terminal His-tag was expressed 
and isolated from Saccharomyces cerevisiae strain BJ2168 in accordance with a 
previously described method, with minor modifications*’. Mutant derivatives 
were generated from wild-type VrH"-PPase by a QuikChange site-directed 
mutagenesis method”, and the sequences were verified by DNA sequencing. 
The VrH*-PPase was expressed in the yeast cells, which were transformed with 
the construct pYVH6 containing VrH*-PPase cDNA. The yeast cells were 
harvested by centrifugation at 4,000g for 10 min after induction for 3 days. The 
cell lysates were centrifuged at 4,000g for 10 min, and the supernatant was ultra- 
centrifuged at 84,000g for 1h for the collection of microsomal membranes. The 
membranes were suspended in a solution containing 25 mM MOPS-KOH pH 7.0, 
20% (w/v) glycerol, 4 mM MgCl, 400 mM KCland 1 mM phenylmethylsulphonyl 
fluoride (PMSF). The membrane proteins were solubilized from microsomal 
membranes with the use of n-dodecyl B-p-maltoside at a detergent-to-protein 
ratio of 3:1 (w/w) for 1h at 4°C. Insolubilized debris was removed by ultracen- 
trifugation at 84,000g for 1 h. The solubilized VrH * -PPase was purified with Ni** - 
nitrilotriacetate resin (Qiagen, Valencia) and eluted with a solution containing 
25mM MOPS-KOH pH7.0, 20% (w/v) glycerol, 4mM MgCl, 400mM KCl, 
0.15% (w/v) n-decyl B-b-maltoside (DM) and 250 mM imidazole. The purified 
VrH*-PPase was dialysed against 25mM MOPS-KOH pH7.0, 20% (w/v) 
glycerol, 4mM MgCl, 0.15% (w/v) DM, 5mM IDP, and then concentrated to 
10mg ml ’ for the crystallization setup. Enzymatic activities of microsomal and 
purified VrH* -PPases were assayed in accordance with previous methods*. 
Trypsinolysis analysis. Trypsinolysis analysis was performed as described in a 
previous study*®. The microsomal (30 jig) and purified (5 ug) proteins were incu- 
bated with L-1-tosylamido-2-phenylethy! chloromethyl ketone-treated trypsin at a 
ratio of 30:1 (w/w) at 37 °C for 10 min. The proteolysis was stopped by the addition 
of 2% (w/w) SDS and 5 mM PMSF. The samples were subjected to western blotting 
analysis. 

Crystallization and X-ray data collection. The hanging-drop vapour-diffusion 
method”? was used to set up crystallization trials. Both VrH* -PPase (0.5 ul) and 
reservoir (0.5 ul) solutions were mixed and equilibrated against a reservoir solu- 
tion (500 ul) in Linbro plates. Vr-H*-PPase crystals appeared in 2 days in the 
reservoir containing 23% (w/v) PEG2000, 50 mM MES pH.6.5, 10% (w/v) glycerol 
and 250 mM MgCl, at 20 °C. The native crystal data were collected on the BL44XU 
beamline at the SPring-8 synchrotron radiation facility, Japan. The data were 
processed using HKL2000 (ref. 34). VrH * -PPase crystals belong to the monoclinic 
space group C2 with the unit-cell parameters a = 220.8 A, b=888A,c=160.2A 
and f = 125.9°. The Matthew’s coefficient* was calculated to be 3.97 APDal, 
corresponding to a solvent content of 69% with two subunits per asymmetric unit. 
For phase determination, three VrH * -PPase heavy-atom derivatives were obtained 
from TagBr, (2mM), OsO, (1 mM) and orange-Pt (1 mM). The anomalous data 


were collected on the BL13B1 and BL13C1 beamlines at the National Synchrotron 
Radiation Research Center, Taiwan. The data statistics are summarized in 
Supplementary Table 1. 

Structural determination and refinement. The structural phase of Vr-H* -PPase 
was obtained with a tantalum derivative by MAD and from OsOy, and Pt-organe 
derivatives by MIRAS (Supplementary Table 1). The heavy-atom sites were 
located with SOLVE”. The final phase combination was calculated by AutoSol 
using PHENIX*”*, resulting in distinguishable protein and solvent regions. The 
preliminary structural model was automatically built by AutoBuild’’”**, and the 
entire model was completed manually with COOT”. The refinement of the struc- 
ture was performed with REFMACS (ref. 40). PROCHECK"' was used to evaluate 
the stereochemistry and to assign the secondary structural elements. The struc- 
tural model was refined to an R-factor of 16.80% and an Réree of 20.31% at 2.35 A. 
The refinement statistics are summarized in Supplementary Table 1. All the figures 
were generated using PYMOL”. The solvent-accessible surface area was calculated 
with the program HOLLOW® using a 1.4-A probe radius. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature1 1029 


Corrigendum: Recovery rates 
reflect distance to a tipping point in 
a living system 


Annelies J. Veraart, Elisabeth J. Faassen, Vasilis Dakos, 
Egbert H. van Nes, Miquel Ltirling & Marten Scheffer 


Nature 481, 357-359 (2012) 


There was a scaling error in the light attenuation values of Fig. 1 and 
Supplementary Fig. 2.1 (about 210 should have been subtracted from 
each value). Also, the sentence “We perturbed the populations every 
4-5 days by removing 10% of their biomass through dilution.” on 
page 357 should have read “We perturbed the populations every 
4-5 days by flushing with medium (10% of the volume), which was 
equivalent to a reduction of the biomass by 3-5% owing to incomplete 
mixing.”. This sentence and Fig. 1 and Supplementary Fig. 2.1 have 
been replaced in the PDF and HTML versions online. These changes 
do not alter any of the conclusions of this Letter. We thank J. Huisman 
for drawing our attention to these issues. 
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CAREERS 


Postdoc’s software code for 
imaging pays career dividends p.407 


Narcissists are more likely to 
excel in job interviews, says study p.407 


For the latest career 
listings and advice www.naturejobs.com 


Images from the surface of Mars: analysing these may form part of an astrobiologist’s role. 


ASTROBIOLOGY 


Alien encounters 


Establishing a career in astrobiology is not always easy. But 
keeping one’s options open can lead to work in this niche field. 


BY NAOMI LUBICK 


ichael Callahan knows that he has 
Mm: great job every time a meteorite 

sample arrives at the laboratory from 
NASAs meteorite archive in Houston, Texas. 
“There's no cooler feeling than working with a 
meteorite in the lab? he says. 

Callahan, an analytical chemist at the Astro- 
biology Analytical Laboratory at NASA’s 
Goddard Space Flight Center in Greenbelt, 
Maryland, is one of a growing number of 
researchers around the world working in astro- 
biology — the exploration of how and where life 
might have emerged in the Universe. 

The breadth of astrobiology can be daunt- 
ing. Because the field focuses on questions 
about the origins of life, it incorporates 


aspects of astronomy and physics as well as 
chemistry, geology, oceanography, micro- 
biology and bioinformatics. As a result, 
there are many different routes to a career in 
astrobiology; Callahan, for example, started 
out as an analytical chemist in industry. 
Exobiology — the study of the possibility of 
life on other planets — thrived with origins- 
of-life research in the 1950s and ‘60s, and 
really took off with the space exploration pro- 
grammes in the 1960s and ‘70s. Later, when 
the astrobiology label took hold, some people 
questioned the field’s seriousness, notes Zita 
Martins, an astrobiologist at Imperial College 
London. Many different types of researchers 
now contribute to the field; geomicrobiologists 
look for extremophiles — organisms that live in 
extreme environments such as deep caves or hot 
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springs; astronomers search for chemical signs 
of metabolism on other planets; and biologists 
investigate how life on Earth first appeared. 

Martins, for example, works in the depart- 
ment of Earth science and engineering, 
developing techniques for uncovering microbial 
life and metabolism in extreme settings, using 
soil samples from the Utah desert as stand-ins 
for samples from Mars. She collaborates with 
soil scientists, geochemists, biologists and 
robotics engineers who specialize in explora- 
tion. “That’s the beauty of the astrobiology 
field. It’s scientists from different research 
fields, working together,’ she says. NASA has an 
annual conference dedicated to the field, which 
is taking place this week in Atlanta, Georgia. 


JOINT MISSIONS 

Support for astrobiology research is spread 
across many funding streams and disciplines, 
and finding funding and jobs that are strictly 
astrobiology-related is difficult, especially given 
recent budget woes. NASA and the European 
Space Agency (ESA) support the largest num- 
ber of astrobiology-related positions, whether 
they are in Earth-science observation or engi- 
neering jobs on the next space mission. But 
NASA is facing budget cuts that have alarmed 
researchers in the field. For 2012 and 2013, a 
slight increase in the budget for human space 
exploration, from US$3.7 billion to a proposed 
$3.9 billion, was countered by a decrease for the 
science directorate, including planetary-science 
research, from $5.1 billion to a proposed $4.9 
billion. The agency has withdrawn its com- 
ponent of the ExoMars programme — a joint 
mission with ESA to send an orbiter and rover 
to look for methane on the red planet, which is 
scheduled for 2018. NASA’ funding would have 
created more research positions for biologists 
and instrumentalists, among others. 

Although the US Terrestrial Planet Finder 
— a mission intended to look for Earth-sized 
planets in the habitable zones of distant solar 
systems — has been suspended, astronomers 
hope that NASA’s James Webb Space Telescope, 
an observational mission with similar planet- 
finding goals, will continue moving forward. 

Despite cuts in some areas, NASA’ Earth 
budget is up slightly from $1.76 billion in 2012 
to $1.79 billion in 2013, notes geochemist 
Mitchell Schulte, a programme officer for the 
Mars Exploration Programme at NASA head- 
quarters in Washington DC. That means that 
research done on this planet but with applica- 
tions off-planet could still be awarded fund- 
ing, such as hunting for extremophiles and 
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Noctis Labyrinthus, a region of the red planet captured by NASA’s Mars Odyssey orbiter. 


their proteins, or remote-sensing research such 
as LiDAR (a technique for detecting water, or 
mapping minerals and topography). 

The Centre for Astrobiology in Madrid, 
which was established in 2000 as a partner 
to the NASA Astrobiology Institute based in 
Mountain View, California, is an example of 
an organization that is crucial for strengthen- 
ing ties between researchers around the world, 
says astrophysicist 


Lisa Kaltenegger, who “Tlook for 

holds dual positions at people who are 
the Harvard-Smithso- Willing to take 
nian Centerfor Astro- thisrisk. In my 
physicsin Cambridge, experience, 
Massachusetts, and thatisan 

the Max Planck Insti- extraordinarily 
tutefor Astronomyin hard thing todo.” 


Heidelberg, Germany. 

In addition to NASA% partner centre in Spain, 
other centres have sprung up, including Ger- 
many’s Institute of Planetary Research in Berlin, 
which is a cooperative alliance of about 200 sci- 
entists across 17 science organizations, with 
funding from the Helmholtz Alliance based in 
Bonn, which studies planetary evolution. Gerda 
Horneck, a former deputy director of the Ger- 
man Aerospace Centre's Institute of Aerospace 
Medicine in Cologne, says that there are now 
25 PhD candidates in the Helmholtz Space Life 
Sciences Research School, based in various loca- 
tions in Germany. 

For fledgling scientists in the field, the eco- 
nomic climate means that success can hinge on 
finding multiple funding sources. Julie Huber, a 
microbial oceanographer at the Marine Biologi- 
cal Laboratory in Woods Hole, Massachusetts, 
whose research focuses on life at deep hydro- 
thermal vents, says she applies for grants from 
NASA and the US National Science Foundation 
for oceanography programmes and occasion- 
ally from the US National Institutes of Health 
for microbiology research. She also applies to 
private foundations such as the Gordon and 
Betty Moore Foundation in California and the 
Alfred P. Sloan Foundation in New York. Huber 
says she considers herself lucky to have secured 
funding, given the budget woes of govern- 
ment agencies and the state of the foundations’ 
endowments. “It used to be you had to write one 
or two grants to support yourself for five years,” 
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she says. “Now it’s four, five, six grants every 
year, with no growth in the pool of money.” 

Other initiatives have helped to augment 
support for the field. The CAREX project from 
the European Union's Seventh Framework 
programme for research, which coordinated 
the study of life in extreme environments, 
wrapped up this year after spending €1.4 mil- 
lion (US$1.8 million). Its data on desert life, 
specifically microbial diversity and aspects of 
life in extreme environments, could help to 
inform surveys for life on Mars, although it is 
not labelled as astrobiology. The next round of 
European funding, Horizon 2020, will include 
support especially for astrobiology and space 
exploration, says Horneck. How much sup- 
port is still under discussion. Up to this point, 
various projects across partnered universities 
and institutes have received about €2 million 
over three years from the European Union for 
cooperative astrobiology research, the CAREX 
funding being one example. 


SPACE SCHOOL 

Few PhD programmes award astrobiology 
degrees; students typically earn their degree 
in a field such as analytical chemistry or 
microbiology but focus on astrobiology for their 
thesis. For example, the University of Washing- 
ton in Seattle confers a certificate in astrobiol- 
ogy for graduate-level research, but graduate 
students retain their home-department affilia- 
tion for their doctorate. Graduates with this cer- 
tificate have gone on to work as astrobiologists 
or in related fields at NASA or other institutes. 


Astrobiologists hope to continue to study stars 
such as 51 Ophiuchi (artist’s rendition pictured). 
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In 2005, Stockholm University launched an 
astrobiology graduate school for an initial five 
years to gauge interest in the field, a sort of 
interdisciplinary curriculum experiment. The 
university considered it a success, and depart- 
ments including astronomy and geology have 
transferred their experience into the newly cre- 
ated astrobiology centre. 

Sandra Siljestrém’s career exemplifies 
the potential myriad roles an astrobiologist 
can have. A graduate from the Stockholm 
programme who trained as a geochemist, 
Siljestr6m is nowa research scientist at the gov- 
ernment-run SP Technical Research Institute of 
Sweden in Boras, where she works on projects 
for the Swedish space board, developing meth- 
ods to examine small amounts of organic com- 
pounds using time-of-flight spectroscopy. She is 
also helping to develop the Mars Organic Mol- 
ecule Analyser, an instrument to be carried on 
the ExoMars rover. Her position also includes 
contract work for industry, examining organic 
compounds in rocks and other materials. 


MORE BODIES 

Horneck advises undergraduates and master’s 
students to ground themselves in one of the 
many applicable sciences before moving on to 
astrobiology-related research. She says getting 
a solid grounding in their field before pursuing 
astrobiology is important. 

However, Malcolm Walter, director of the 
Australian Centre for Astrobiology at the Uni- 
versity of New South Wales, advises students to 
start taking astrobiology courses at the under- 
graduate level if they are available. “The skills 
are generic and can be applied to many career 
paths. Being interdisciplinary, thinking in an 
interdisciplinary fashion leads to many career 
opportunities,” he says. “I look for people who 
are willing to take this risk. In my experience, 
that’s an extraordinarily hard thing to do” 

Walter says two of his PhD students recently 
ended up working for NASA — one asa classi- 
cal geologist at the Jet Propulsion Laboratory in 
Pasadena, California, and the other as a mineral 
spectroscopist, mapping mineralogy on the sur- 
face of Mars. Other students have gone on to 
work for biotechnology firms or the booming 
Australian mining industry. 

Horneck worries that talented graduates will 
leave the field because few jobs exist outside 
NASA, ESA and academia, and even there, 
positions are limited. Many projects are suited 
to PhD theses, but not to full-time jobs. Not 
enough opportunities exist for bright doctorates 
to continue their research. “This is something 
that needs attention,” she says. Even so, student 
interest portends good things for the field. “Ten 
years ago, there were no courses. Nowit’a field 
very much taken up by students,’ Horneck says. 
Despite the challenges, she foresees a burgeon- 
ing field, with more discoveries on the horizon. = 


Naomi Lubick is a freelance writer based in 
Stockholm, Sweden. 
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TURNING POINT 
Marc Modat 


A postdoc in non-linear medical-image 
registration at University College London, 
Marc Modat has created imaging software 
that has helped him to co-author more 
than 60 publications. In February, the 
software landed him the university's Young 
Investigator in Neuroimaging Techniques 
Award for 2011. 


You hadn’t planned to do a PhD. How did you 

end up doing one? 

Towards the end of my undergraduate degree 
in bioengineering at the University of Creteil 
in Paris, I did a medical-imaging internship 
at the Commonwealth Scientific and Indus- 
trial Research Organisation near Brisbane, 
Australia, with Sebastien Ourselin, who was 
working on how to compare the brain scans 
of patients. I had never considered doing 
a PhD, but he was moving to University 
College London and he asked if I wanted to 
join his group. This turned out to be one of 
the best career decisions I’ve made. 


Why did you pursue neuroimaging? 

I didn't have strong feelings about medical 
imaging, I just met good people at a good 
time, and I enjoyed working on the algo- 
rithms needed to align and compare the 
scanned images of patients. This is called 
medical-image registration and is a crucial 
piece of the image-processing pipeline. For 
example, if you want to compare the brain 
scans of two different patients or the scans of 
the same patient over time, you need to have 
a way to quantify any changes. 


What was so popular about the software 
code you designed? 

Computation time was a big challenge in 
image processing when I started. We were 
working with large cohorts of patients, and 
it could take hours to process images. I found 
a way to make the technique up to 200 times 
faster but without compromising accuracy, 
and I put the code online for other scientists. 
I'm not saying that my code is better than the 
others available online, but it may be faster, 
which got people’s attention. I received a 
great deal of peer recognition, invitations 
to be part of collaborations and the young 
investigator award because of the code. 


Were there objections to publishing the code? 
Not really. Not everyone's supervisor would 
let you do that — some would prefer to main- 
tain their competitive advantage — but we 
decided to publish the code because we knew 


that it would increase our visibility. It did. I 
receive e-mails from people all over the world 
saying that they are using it or suggesting 
modifications. We are still working on it daily 
and adding more features to ensure accuracy 
and increase its speed. 


Are all your publications a result of 
developing the code? 

Most are. As a PhD student, I co-authored 
publications with other research teams 
because our code and imaging expertise were 
used to help with their work. Iam part of a 
large laboratory, so I have been involved in a 
lot of different clinical studies, but now ’'m 
trying to find a balance between working on 
my projects and working as a collaborator. 


Do you think your career interests will 

dictate the development of the code? 

Going forward, I’m not sure the code will be 
linked only to me; many people are contrib- 
uting to it by suggesting changes or additional 
features. I would like to be involved in eve- 
rything related to the code, but that depends 
on how it grows. We are thinking of merging 
the medical-image registration software, the 
software that reconstructs images, and the 
simulation programs and visualization tools 
into one package, which could have a big 
effect on the field. 


Why is this award important for you? 
Supervisors nominate their students every 
year for this award. Typically, it goes to a 
clinician or a biologist. When I received the 
award, it made me think that my work is as 
valued as that of the clinicians — which is 
nice because I really enjoy what I do. m 


INTERVIEW BY VIRGINIA GEWIN 
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INTERVIEWS 
Narcissists excel 


Job hunters who hope to do well during 
interviews should talk about themselves 
in glowing terms, says a study in the press 
in the Journal of Applied Social Psychology. 
The authors found that behaviours 

that would be considered narcissistic 

in other settings were rated highly in 
simulated job interviews. “This is one 
setting where humility is not valuable,” 
says study co-author Peter Harms, 
assistant professor of management at 

the University of Nebraska-Lincoln. He 
encourages interviewees to speak a lot. 
“Dont give short answers, he says. Those 
who spoke quickly, smiled, gestured and 
complimented others often, and who 
spent time on grooming were rated highly. 


FUNDING 
Global-health research 


A fellowship programme will award 
US$20.3 million over 5 years to 400 
early-career scientists in the United 
States and in low- and middle-income 
countries to build the next generation 

of global-health researchers. Eighteen 
centres at the US National Institutes of 
Health (NIH) in Bethesda, Maryland, 
including the Fogarty International 
Center are supporting the fellowships. 
Participants will be selected by their 
institution to conduct research at one 

of about 80 sites globally. Trainees 

will study issues such as HIV/AIDS, 
tuberculosis, malaria, maternal and child 
health, cancer, cardiovascular disease 
and diabetes. Postdocs will make up 80% 
of new fellows and graduate students the 
remaining 20%. 


INVESTMENT 
UK drug development 


A £250-million (US$397-million) 
investment in venture capital for 
translational research in the United 
Kingdom could create jobs. The European 
Investment Fund in Luxembourg City 
and Cancer Research Technology (CRT), 
the commercial arm of the London-based 
charity Cancer Research UK, are jointly 
investing £50 million in the development 
of potential cancer drugs. A separate 
£200-million initiative by the Wellcome 
Trust in London will support early-stage 
biotechnology firms. Keith Blundy, chief 
executive of CRT, says that experts in drug 
development should find opportunities 

as the funding helps to take more drug 
candidates through to clinical trials. 
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Utes SCIENCE FICTION 


BY KEN LIU 


r Ved and Kathy stared at the chaotic 
scene through the bars of the 
cage. A large, male macaque 

monkey about two feet tall 

screeched and lifted the type- 
writer — a 1953, lime green 

Olivetti Lettera 22 — over his 

head. He stood still for a sec- 

ond like a weight-lifter, and then 
threw the machine hard against the 
ground. It clattered on the dirt floor, 
keys and platen jangling, and came to 
rest with the sequence “jl,dykb nvcx- 
liuear” on the sheet of paper. 

Kathy's hands covered her mouth. 
She blew out a breath. “At least we 
now have our first keystrokes.” 

Ted just shook his head. 

Two smaller males approached the 
typewriter. One jumped up and down on 
the keys: “cx,juoun2 ep89 xad1 *” The other 
watched, and then decided to defecate into 
the curved bowl formed by the type bars. 

“Well, Professor Emroche isn’t going to 
want that typewriter back,’ Kathy said, after 
recovering from the initial shock. Then she 
started to laugh. The monkeys stopped to 
stare at her, which only made her laugh 
harder. 

Ted shook his head again. 


The ‘Monkey Shakespeare’ project was an 
interdisciplinary collaboration between the 
humanities and the science departments. But 
after a short clip of the monkeys pooping into 
the typewriter was circulated online, every- 
one began to distance themselves from it. 

“We don't really think there’s much science 
in a project like this,” said Professor Kun of 
the Department of Computer Science. “That 
old chestnut about an infinite number of 
monkeys banging on an infinite number 
of typewriters is just a thought experiment. 
You should really ask the literature people 
what they’re trying to accomplish with this.” 

“This is obviously not how we envisage 
the future study of literature; said Professor 
Emroche of the Department of American 
and English Language and Literature. “We 
already have the works of Shakespeare, so 
why would we want monkeys reproducing 
it by chance? You should really talk to the 
biology people about what they are trying 
to get out of it.” 

In the end, Ted and Kathy, undergraduate 
assistants who had yet to declare their majors 
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and who had signed on to feed the monkeys, 
became the only names attached to the pro- 
ject. The students felt like orphans. No one 
wanted to claim responsibility for them. 

They decided to treat the project as a piece 
of performance art. 


After a while, the monkeys stopped abus- 
ing the typewriter. They mainly left it alone 
during the day, like a toy they found boring. 
But once in a while, one or a group of them 
would come by and bang on the keys until 
they grew tired of it again. Ted and Kathy 
stopped by several times a day to swap out 
filled pages for fresh ones. 

Kathy flipped through the stack of type- 
script they had collected, examining each 
like a puzzle. 

“There's a word!” she said. 

Ted looked at where she was point- 
ing. Amid a full page of jumbled let- 
ters, a sequence of four stood out like a 
bright diamond: “734q9u8o0pfuion wise 
djk;we897d78” 

Ted sighed. “It took the monkeys 5 days 
to produce 25 pages, and we have one word. 
At this rate, we aren't going to have much to 

show by the end of the 


> NATURE.COM semester. Not much of 
FollowFutureson = anart project?” 

Facebook at: “You're missing the 
go.nature.com/mtoodm = point about art; Kathy 
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said. She threw the papers up in the air and 
watched them drift to the ground like a 
flock of pigeons. “It’s not about what you 
have to show, but how you interpret what 
you have.” 
“How would you interpret this?” 
“Tt shows just how rare order is 
in this random universe we live 
in. It shows how art is precious 
and miraculous. It shows the true 
measure of Shakespeare's genius.” 
Ted laughed. “Yeah, that B. S. 
sounds pretty good.” 
“Tt’s the viewer who gives meaning to 
art, Kathy said, laughing as well. “Just 
like it’s the reader who gives meaning 
to the book.” 


oS 


Wflq rushed into the Eqqlb, gesticulat- 
ing wildly. 

She waved a sheaf of papers in front of 
7*uLw, who asked her calmly what in the 
world was the matter with her. 

Wflq was too excited to speak coher- 
ently. It took a while before 7*uLw finally 
understood that her experiment had finally 
yielded some results. In one of the infinite 
number of universes that she had been 
observing, the native creatures had finally 
produced something worthy of notice. 

Although the natives who dominated that 
world seemed to bang on keyed machines 
randomly all the time, this time they had 
allowed some other creatures — deemed 
inferior by them for some reason — to 
use one of their machines. These ‘lesser’ 
creatures immediately produced the most 
beautiful poetry that the universe had ever 
created. Out of the babbling, random chaos 
that made up almost all the symbolic output 
of that world, this new string stood out like a 
clear voice in the wilderness. 

But tragically, the natives of that universe 
did not seem to understand what they had. 
Wflg had to rescue a great piece of art. 

Here’s how the masterpiece began: 
“jLdykb nvcxliuear cx,juoun2 ep89 xadl “...” 

As they read, Wflq and 7*uLw fell in 
rapture. The beauty of the language over- 
whelmed them. 

Translated into our language, the book 
Wflq produced started this way: 
Boatswain! 

Here, master: what cheer? = 


Ken Liu lives and writes in Massachusetts. 
You can read more of his fiction at 
http://kenliu.name/stories. 
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A Xanthomonas uridine 5'-monophosphate 
transferase inhibits plant immune kinases 


Feng Feng!?*, Fan Yang’, Wei Rong“, Xiaogang Wu’, Jie Zhang*, She Chen*, Chaozu He”* & Jian-Min Zhou*” 


Plant innate immunity is activated on the detection of pathogen- 
associated molecular patterns (PAMPs) at the cell surface, or of 
pathogen effector proteins inside the plant cell’*. Together, PAMP- 
triggered immunity and effector-triggered immunity constitute 
powerful defences against various phytopathogens. Pathogenic 
bacteria inject a variety of effector proteins into the host cell to assist 
infection or propagation. A number of effector proteins have been 
shown to inhibit plant immunity’, but the biochemical basis 
remains unknown for the vast majority of these effectors. Here we 
show that the Xanthomonas campestris pathovar campestris 
type III effector AvrAC enhances virulence and inhibits plant 
immunity by specifically targeting Arabidopsis BIK1 and RIPK, 
two receptor-like cytoplasmic kinases known to mediate immune 
signalling’ *. AvrAC is a uridylyl transferase that adds uridine 
5’-monophosphate to and conceals conserved phosphorylation sites 
in the activation loop of BIK1 and RIPK, reducing their kinase 
activity and consequently inhibiting downstream signalling. 

Xanthomonas campestris pathovar campestris (Xcc) is a causal agent 
of black rot diseases on numerous crucifer plants such as Brassica and 
Arabidopsis. AvrAC exists in all three sequenced strains of Xcc and a 
newly sequenced Xanthomonas campestris pathovar raphani strain, 
but not other Xanthomonas species. (http://www.xanthomonas.org/ 
t3ehtml). In Xcc strain 8004 (Xcc8004), AvrAC is recognized by 
Arabidopsis plants to trigger a weak effector-triggered immunity 
(ETD) in vascular, but not mesophyll tissues’. To determine if AvrAC 
has a role in virulence, we generated an avrAC knockout strain 
(Xcc8004A.avrAC) and performed competitive index assays on cabbage 
plants (Methods). The competitive index values (Xcc8004AavrAC to 
Xcc8004 ratios) were 0.5-0.3 in 3/5 cultivars tested in multiple experi- 
ments (Fig. 1a), demonstrating a role of AvrAC in virulence on these 
plants. Well known PAMPs, including the bacterial flagellar peptide 
flg22, bacterial elongation factor (EF-Tu) peptide elf18, and fungal cell 
wall component chitin", are recognized by pattern recognition recep- 
tors (PRRs) FLS2, EFR and CERKI1, respectively*, to induce the 
expression of a PAMP-responsive reporter gene FRK1::LUC. The pres- 
ence of the AvrAC transgene largely inhibited FRK1::LUC induction in 
protoplasts (Fig. 1b and Supplementary Fig. 1), indicating that AvrAC 
is a potent inhibitor of the PAMP-triggered immunity (PTI) signalling 
pathway. 

Two AvrAC transgenic lines were generated to test if AvrAC 
impedes mesophyll resistance to Xcc8004, a nonpathogenic mutant 
bacteria Xcc8004AhrcV and Pseudomonas syringae pathovar tomato 
(Pst) AhrcC mutant bacteria. In all three cases, the transgenic lines 
supported 0.7-1.1 log more bacterial growth than did the wild-type 
plants (Fig. 1c, d). Thus AvrAC can markedly compromise PTI res- 
istance to these bacteria. The AvrAC transgenic lines were also inocu- 
lated with Pst strains carrying effector genes avrB, avrRpt2 or avrPphB, 
which elicit ETI resistance specified by the cytoplasmic immune recep- 
tors RPM1, RPS2 and RPS5, respectively’. The transgenic lines sup- 
ported 0.9-1.0 log more Pst(avrB) bacterial growth than did wild-type 


plants (Fig. le). In contrast, the transgenic lines supported less than 0.3 
log increase in bacterial growth when Pst(avrRpt2), Pst(avrPphB) and 
Pst bacteria lacking an avr gene were used (Fig. le and Supplementary 
Fig. 2). Thus, although the AvrAC transgene compromised RPM1- 
dependent ETI, it did not appear to affect RPS2- and RPS5-dependent 
ETI. To determine further if bacterially delivered AvrAC similarly 
interferes with RPM] activation, we infiltrated Arabidopsis plants with 
Xcc8004 before Pst(avrB) inoculation. Whereas the control plants 
inoculated with Pst(avrB) alone developed strong a hypersensitive 
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Figure 1 | AvrAC inhibits plant immunity. a, avrAC promotes Xcc bacterial 
virulence on cabbage plants. Competitive index value represents an average 
ratio of Xcc8004AavrAC to Xcc8004 bacterial growth on the indicated cabbage 
cultivars (mean + s.d.; n = 6; **P < 0.01, Student’s t-test). b, Transient 
expression of AvrAC-Flag in protoplasts inhibits PAMP-induced FRK1::LUC 
transcription (mean ~ s.d.; n = 3). c-e, The AvrAC transgene compromises 
PTI and RPM1-specified ETI in plants. Wild-type (WT; Col-0) and AvrAC 
transgenic lines pre-treated with oestradiol were infiltrated with Xcc8004 and 
Xcc8004AhrcV (c), PstAhrcC (d), or Pst or Pst(avrB) (e). Bacterial population in 
the leaf was measured at the indicated times (mean + s.d.; n = 6; **P< 0.01, 
Student’s t-test). f, Bacterially delivered AvrAC inhibits the RPM1-specified 
hypersensitive response (HR). Wild-type Arabidopsis plants were infiltrated 
with the indicated Xcc strains 1 day before infiltration of Pst or Pst(avrB), and 
development of the hypersensitive response was photographed 6h later. The 
ratio of leaves showing a hypersensitive response to the total number of injected 
leaves is shown on top. 
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response, the plants pre-treated with Xcc8004 developed no hyper- 
sensitive response (Fig. 1f), indicating a strong interference of RPM1 
activation by Xcc8004. The Xcc8004AavrAC mutant strain was less 
capable of preventing hypersensitive response development (Fig. 1f 
and Supplementary Fig. 3), indicating that AvrAC interferes with 
RPM1-mediated ETI when delivered from Xcc. 

To identify host targets and the mechanism by which AvrAC inhi- 
bits PTI, we systematically investigated various molecular events of 
PTI signalling pathways"’. flg22 perception triggers HO, production 
and MAP kinase (MAPK) activation in wild-type plants within several 
minutes; both events were completely abolished or diminished in 
AvrAC transgenic plants (Fig. 2a, b). However, AvrAC was unable to 
inhibit MAPK activation triggered by the constitutively active 
MKKS”” (ref. 10; Supplementary Fig. 4). These results suggested that 
AvrAC acts upstream of MAPK cascades and HO, production to 
inhibit PTI signalling. 

PAMP detection rapidly induces phosphorylation of BIK1 (refs 6, 
7), which acts immediately downstream of FLS2, EFR and CERK1 (ref. 
7). The expression of AvrAC in protoplasts prevented the flg22- 
induced BIK1 phosphorylation, as indicated by a slow migrating form 
on SDS-polyacrylamide gel electrophoresis (SDS-PAGE; Fig. 2c). 
Instead, BIK1 uniformly migrated between the unphosphorylated 
and phosphorylated forms independent of flg22 treatment. 

Upstream of BIK1 phosphorylation is the flg22-induced association 
between FLS2 and a receptor-like kinase called BAK1, forming an 
active receptor complex'*’*. This interaction occurred normally in 
protoplasts irrespective of the presence or absence of AvrAC (Fig. 2d). 

The results described above suggested that AvrAC directly target 
BIK1, FLS2 or BAK1. Co-immunoprecipitation assays showed that 
AvrAC can interact with BIK1, but not FLS2 or BAK1 (Fig. 2e). 
Glutathione S-transferase (GST) pull-down assays indicated that this 
interaction is direct (Fig. 2f). RIPK, an RLCK related to BIK1, can 
phosphorylate the RPM1-interacting protein RIN4 at T166 (ref. 8), 
and the phosphorylated RIN4 is responsible for triggering RPM1- 
specified ETI*"*. AvrAC also interacted with RIPK in vitro and in 


AvrAC-Flag 


_ Input 
cE Anti-Flag IB 


protoplasts (Figs 2f and 4b). We conclude that BIK1 and RIPK are 
specific targets of AvrAC, a conclusion consistent with its inhibition of 
PTI and the RPM1-specific ETI. 

The AvrAC-induced electrophoresis mobility shift of BIK1 was 
resistant to phosphatase treatment (Supplementary Fig. 5a). Co- 
expression of BIK1 and AvrAC in Escherichia coli detected the same 
phosphatise-resistant, slow migrating form of BIK1 (Supplementary 
Fig. 5b, c), suggesting an AvrAC-induced BIK1 modification that is not 
caused by protein phosphorylation. 

AvrAC contains a filementation-induced by c-AMP domain (Fic) in 
the carboxy terminus. More than 4,300 Fic-domain-containing 
proteins have been found in bacteria and animals, mostly with 
unknown biochemical function’*'®. Two Fic-domain-containing, 
animal bacterial pathogen effectors Vibrio VopS and Histophilus 
IbpA catalyse adenylylation (AMPylation) on threonine or tyrosine 
residues of Rho family GTPases to disrupt cytoskeleton in the host 
cell’”’*. Another bacterial effector protein, Legionella pneumophila 
DrrA (also known as SidM) uses a glutamine synthetase adenylyl 
transferase domain to adenylylate another host GTPase called 
Rab1b’’. However, in vitro labelling and mass spectrometric assays 
with AvrAC and BIKI recombinant proteins failed to detect any 
AMPylation on BIK1. Surprisingly, mass spectrometry of the AvrAC 
recombinant protein (expressed alone in E. coli) revealed multiple 
modified protein components with serial additions of 306 Da, indi- 
cative of uridine 5'-monophosphate (UMP) modifications (Sup- 
plementary Fig. 6a). Tandem mass spectrometry (MS/MS) revealed 
multiple serine and threonine residues in AvrAC containing mono- 
UMP modifications (Supplementary Fig. 6b). The results suggested 
that AvrAC is a UMP transferase. In vitro assays with o°*P-UTP 
showed that both BIK1 and RIPK were uridylylated only when incu- 
bated with AvrAC (Fig. 3a). AvrAC also labelled itself in this assay, 
explaining the UMP modifications on AvrAC described above. PBL1, 
a homologue of BIK1 that acts additively with BIK1 in PTI signalling’, 
was also modified by AvrAC (Supplementary Fig. 7). Interestingly, 
another RLCK VII subfamily member, PBS1, was not modified under 
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Figure 2 | AvrAC targets BIK1 and inhibits early PTI signalling events. 
a, Flg22-induced H,O, production in wild-type and AvrAC transgenic plants 
(mean + s.d.; n = 8). b, Flg22-induced MAPK phosphorylation in wild-type 
and AvrAC transgenic plants. The identities of phosphorylated MAPKs are 
labelled as described (Methods). Equal loading is indicated by immunoblot 
with anti-MPK3 antibodies or ponceau staining of Rubisco protein. c, Flg22- 
induced BIK1-HA phosphorylation in protoplasts in the presence or absence of 
AvrAC-Flag. d, Flg22-induced FLS2-BAK1 association in protoplasts in the 


2 | NATURE | VOL 000 | 00 MONTH 2012 


72 
26 


Q 
a 
4 
L 


Input 


Anti-Flag IP GST pull-down 


presence of AvrAC as determined by co-immunoprecipitation assays. The 
minor differences of BAK1-HA levels resulted from sample processing. 

e, AvrAC interacts with BIK1, but not FLS2 and BAK1, in protoplasts. BIK1- 
HA, BAK1-HA and FLS2-HA were co-expressed with AvrAC-Flag in 
protoplasts and subjected to co-immunoprecipitation. f, AvrAC interacts with 
BIK1 and RIPK in vitro in a GST pull-down assay. The bound proteins were 
analysed by anti-His immunoblot and the membrane stained with Coomassie 
blue (CBB). IB, immunoblot; IP, immunoprecipitation. 
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Figure 3 | AvrAC uridylylates BIK1 and RIPK. a, AvrAC uridylylates BIK1 
and RIPK, but not FLS2 and BAK], in vitro. GST-AvrAC was incubated with 
His-BIK1, His—RIPK, or His-tagged kinase domains (KD) of FLS2 or BAK], in 
the presence of «°*P-UTP. The resulting products were subjected to SDS- 
PAGE and autoradiography. b, c, AvrAC uridylylates conserved serine and 
threonine residues in the activation loop of BIK1 and RIPK in the plant cell. 
BIK1-Flag (b) or RIPK-Flag (c) was co-expressed with AvrAC-HA in 
protoplasts, affinity purified, and subjected to MS/MS spectrometric analyses. 
The b- and y- ions are marked in the spectrum and illustrated along the peptide 
sequence above the spectrum. This unambiguously identified BIK1 Ser 236 and 
Thr 237, and RIPK Ser 251 and Thr 252 as UMP-modified residues. A 


the same condition. The same assay failed to label the kinase domain of 
FLS2 and BAK1, indicating that certain members of the RLCK VII 
subfamily are specific substrates of AvrAC (Fig. 3a). MS/MS analysis 
on the recombinant BIK1 protein that had been co-expressed with 
AvrAC uncovered a tryptic peptide (DGPMGDLSYVSTR) derived 
from the activation loop containing UMP at Ser 236 and Thr 237 
(Supplementary Fig. 8a). Likewise, RIPK that had been co-expressed 
with AvrAC also produced a tryptic peptide (DGPEGDDTHVSTR) 
located in the activation loop in which Ser 251 and Thr 252 were UMP 
modified (Supplementary Fig. 8b). BIK1 and RIPK proteins that had 
been co-expressed with AvrAC in protoplasts also contained the same 
UMP modifications (Fig. 3b, c), indicating that AvrAC can uridylylate 
BIK1 and RIPK in the plant cell. The vast majority of the BIK1 and 
RIPK activation loop peptides were uridylylated (Supplementary 
Fig. 9). Together these results unequivocally demonstrated that 
AvrAC is a potent enzyme that uridylylates BIK1 and RIPK both 
in vitro and in plant cells. 

ATP is the preferred nucleotide substrate for IbpA and VopS to 
modify Rho GTPases in vitro’’, although GTP, UTP and CTP can also 
be used. Whether nucleotides other than ATP are used by these effec- 
tors to modify Rho GTPases in host cells remains unknown. AvrAC 
failed to AMPylate BIK1, and we observed no AMPylation on any 
peptides of BIK1, RIPK or AvrAC expressed in E. coli or plant cells. 
Another Fic-domain protein, AnkX, from Legionella pneumophila was 
recently shown to covalently attach a phosphocholine moiety to the 
Rho GTPase Rabl (ref. 20). The commonality shared by the three 
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represents neutral loss of an UMP group with nominal mass of 324 Da. 

d, Alignment of the activation loop of selected RLCK VI kinases. The UMP 
modification sites are highlighted in red. e, BIK1 Ser 236 and Thr 237 are 
required for uridylylation by AvrAC. Various BIK1-HA mutants carrying the 
indicated amino acid substitutions were co-expressed with AvrAC-Flag and 
analysed by immunoblot. Note the increased BIK1 protein level in the presence 
of AvrAC, suggesting an AvrAC-induced stabilization of BIK1 protein. 

f, Modification by AvrAC inhibits BIK1 and RIPK kinase activity. His-BIK1 or 
His-RIPK co-expressed with GST—AvrAC or GST in E. coli was affinity 
purified and subjected to autophosphorylation assay. 


modifications suggests that Fic-domain proteins are phosphotrans- 
ferases with different substrates. 

BIK1 Ser 236 and Thr 237 and RIPK Ser 251 and Thr 252 are highly 
conserved among RLCK VII subfamily members (Fig. 3d). 
Phosphorylation of BIK1 Ser 236 and Thr 237 has a crucial role in 
PTI signalling®”*'. AvrAC failed to induce the BIK1(S236A) and 
BIK1(T237A) mutant protein migration in protoplasts (Fig. 3e), indi- 
cating that these residues are required for UMP modification. A 
similar result was obtained with a BIK1(S233A,Y234A,S236A,1237A) 
quadruple mutant’ (Supplementary Fig. 10). In contrast, mutations in 
Lys 105, an invariant residue required for ATP binding, and Ser 233 
and Tyr 234, two less conserved amino acid residues in the activation 
loop, did not affect the AvrAC-induced migration (Supplementary Fig. 
10 and Fig. 3e). RIPK and BIK1 that had been UMP modified by 
AvrAC in E. coli displayed reduced autophosphorylation (Fig. 3f). 
Taken together, these results demonstrated that AvrAC inhibits 
BIK1 and RIPK kinase activities by directly uridylylating the conserved 
phosphorylations sites in the activation loop, thereby preventing their 
phosphorylation and subsequent signal transduction. 

AvrAC contains an N-terminal domain (NTD) of unknown func- 
tion, a leucine-rich-repeat domain (LRR), and a C-terminal region 
(CTR) harbouring the Fic domain (Fig. 4a). The homology of the 
AvrAC Fic domain with other Fic domains mainly resides in the 
C-terminal subdomain (Supplementary Fig. 11), in which the highly 
conserved HPFx(D/E)GN(G/K)R motif is responsible for nucleotide 
binding and catalysis**’*. In particular, the invariant His residue 
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Figure 4| AvrAC uridylyl transferase activity and BIK1 are required for 
AvrAC virulence in plants. a, Schematic diagram of AvrAC mutant constructs 
and their interaction with BIK1 and RIPK. The ability of these constructs to 
interact with BIK1 and RIPK is summarized on the right. b, The AvrAC NTD 
and LRR, but not CTR, interacted with RIPK in co-immunoprecipitation assays. 
c, In vitro uridylylation of RIPK by the indicated GST-AvrAC recombinant 
proteins. d, Inhibition of flg22-induced FRK1::LUC expression by AvrAC 
mutants (mean = s.d.;n = 3). e, The LRR domain and Fic motif are required for 
AvrAC virulence in cabbage plants. Competitive index assays were carried out 
on cabbage cultivar Zhonggan15 with Xcc8004davrAC complemented with the 
indicated constructs (mean + s.d.; n = 6; **P < 0.01, Student’s t-test). f, AvrAC 


corresponding to AvrAC His 469 has a key role in the catalysis. AvrAC 
constructs lacking the Fic catalytic motif (AFic) or CTR (ACTR) were 
fully capable of interacting with BIK1 and RIPK, whereas the AvrAC 
CTR alone was completely unable to interact with BIK1 and RIPK 
(Fig. 4b and Supplementary Fig. 12). Furthermore, the AvrAC NTD 
alone, the construct lacking NTD, and the construct lacking LRR were 
all able to interact with BIK1, indicating that NID and LRR constitute 
the kinase-binding region. Removing the kinase-binding region or 
CTR, or substitution of His 469 with Ala, rendered AvrAC completely 
inactive in RIPK uridylylation in vitro (Fig. 4c), and induced a BIK1 
mobility shift in protoplasts (Supplementary Figs 12 and 13). These 
results are consistent with the notion that the AvrAC Fic domain is 
responsible for catalysis, whereas NTD and LRR allow docking of 
substrate proteins. 

We next determined the role of individual AvrAC domains in the 
inhibition of PTI signalling and virulence function. All the AvrAC 
deletion constructs were unable to inhibit FRK1::LUC expression, 
whereas the AvrAC(H469A) mutant was partially affected (Fig. 4d, 
Supplementary Figs 14 and 15). Only the wild-type, but not the 
truncated versions of avrAC, was able to promote Xcc80044avrAC 
bacterial growth on Zhonggan15 plants (Fig. 4e). Together these 
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contributes to Xcc virulence in Arabidopsis plants in a BIK1-dependent manner. 
XccB186 and XccB186AavrAC were co-infiltrated into Arabidopsis plants of the 
indicated geneotypes for competitive index assay (mean + s.d.; n = 6; 

**P = 0.01, Student’s t-test). g, A model for AvrAC-catalysed uridylylation and 
inhibition of plant immunity. In normal plants (left), BIK1 is phosphorylated on 
stimulation by upstream immune receptor kinase complexes, consisting of a 
PRR (such as FLS2 and EFR) and BAK] (refs 6, 7), whereas RIPK is 
phosphorylated upon stimulation by AvrB*. In plants that are infected with Xcc 
(right), AvrAC uridylylates the conserved serine and threonine residues in the 
activation loop of BIK1 and RIPK, thereby preventing the phosphorylation- 
dependent activation of these kinases and downstream signalling. 


results indicated that the ability of AvrAC to inhibit immune signalling 
and enhance virulence not only required its binding to, but also UMP 
modification of, the substrate proteins. 

Xcc8004 and Xcc8004AavrAC are weakly virulent on Arabidopsis 
plants’, preventing a test of AvrAC virulence function in Arabidopsis. 
We found that Xcc strain B186, which is highly virulent on Arabidopsis 
plants™, contains AvrAC with a single amino acid substitution 
(AvrAC(T358A)). We then generated an XccB186AavrAC mutant 
and conducted competitive index assays in Arabidopsis mesophyll 
tissues. XccB186AavrAC grew to ~60% of XccB186 in wild-type 
Arabidopsis plants, and the difference was statistically significant 
(Fig. 4f). In contrast, the two strains grew to nearly identical levels in 
bik1 mutant plants. The bik] mutant is known to accumulate high levels 
of the defence hormone salicylate”, which may potentially affect the 
virulence assay. We therefore carried out the assays on the sid2 mutant, 
which has greatly reduced salicylate production. XccB186AavrAC grew 
to ~40% of XccB186 on sid2 plants (Fig. 4f). In contrast, the two strains 
grew to nearly identical levels in the bik1/sid2 double-mutant plants, 
again indicating that avrAC promotes XccB186 bacterial growth in a 
BIK1-dependent manner. Together these results demonstrated that 
BIK1 is a key target for AvrAC virulence activity. 
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In this study we demonstrated that AvrAC inhibits both PTI and 
RPM1-specified ETI by targeting two related RLCKs, BIK1 and RIPK, 
as illustrated in the model (Fig. 4g). BIK1 and PBL] are not required for 
PAMP-induced MAPK activation (Supplementary Fig. 16), suggesting 
that additional host targets account for the MAPK inhibition by 
AvrAC. Although Xcc is not known to trigger the RPM1-specific ETI 
in Arabidopsis, the Xcc bacterium carries XopAH (also known as 
AvrXccC), which is highly homologous to AvrB”*. It is formally possible 
that XopAH triggers ETI specified by an unknown resistance protein in 
some plants through RIPK. Inactivation of RIPK by AvrAC may allow 
the Xcc bacterium to inhibit such a potential ETI resistance in plants. 
AvrAC is the only Fic-domain effector among all sequenced phyto- 
pathogenic bacteria. However, the Fic and Doc (death on curing) 
domains contain a core structure, designated Fido, which is shared 
by AvrB and XopAH”’. AvrB and XopAH do not carry the catalytic 
His residue conserved in Fic and Doc proteins, and it remains to be 
determined whether they also possess phosphotransferase activities. 

AvrAC uridylylates and masks the conserved phosphorylation sites 
in the activation loop of BIK1 and RIPK, preventing the activation of 
these kinases and impairing plant immunity. Interestingly, a similar 
strategy has been used by the Yersinia effector Yop], which selectively 
acetylates serine and threonine residues in the activation loop of 
human MEK6, thereby blocking the phosphorylation of the target 
protein and signalling’. Multiple P. syringae type III effectors are 
known to target plant protein kinases and inhibit PTI signalling by 
using diverse biochemical mechanisms’. The work described here 
illustrates a unique biochemical mechanism by which the Xcc bac- 
terium combats the plant innate immune system. 


METHODS SUMMARY 


In vitro uridylylation assay. Affinity-purified recombinant His-RIPK or His- 
BIK1 protein was incubated with GST-AvrAC in an uridylylation reaction buffer 
supplied with «-*’P UTP. The products were separated on a SDS-PAGE gel and 
visualized by autoradiography. 

Mapping UMP modification sites. Affinity-purified GST-AvrAC, His-BIK1, 
His-RIPK, BIK1-Flag and RIPK-Flag proteins were separated by SDS-PAGE. 
The corresponding protein bands were sliced and in-gel digested with trypsin 
overnight at 37 °C. The resulting peptides were extracted and analysed by nano- 
flow LC-MS/MS on a QSTAR XL mass spectrometer. Database searches were 
performed ona Mascot server with the following variable modifications: oxidation 
on methionine, carbamidomethylation on cysteine, UMP modification on serine, 
threonine or tyrosine, and AMP modification on serine, threonine or tyrosine. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plants and bacterial strains. Arabidopsis plants were grown in a growth room at 
23 °C and 70% relative humidity with a 10/14h day/night light cycle for 5 weeks 
before protoplast isolation or bacterial inoculation. Cabbage cultivars 
Zhonggan15, Chungan45, Jingfengl, Zhonggan12, Zhonggan8 were grown in 
greenhouses at 25°C. The bacterial strains used in this study included Xcc8004 
(ref. 9), Xcc8004AhrcV (ref. 29), Xcc strain B186 (XccB186) (ref. 23), Pst strain 
DC3000, PstAhrcC (ref. 30), Pst(avrB), Pst(avrRpt2), Pst(avrPphB) and 
Agrobacterium GV3101. 

Constructs, generation of knockout strains and transgenic plants. To generate 
recombinant protein constructs, full-length AvrAC and truncated mutants were 
PCR-amplified from Xcc8004 genomic DNA and inserted between EcoRI and 
Xhol sites of pGEX-6p-1 (Amersham Biosciences) to generate GST-AvrAC, 
GST-CTR (C-terminal region), GST-ACTR and GST-AFic. A GST- 
AvrAC(H469A) mutant construct was generated by site-directed mutagenesis. 
The coding sequences of Arabidopsis RIPK, BIK1 and PBL1 were amplified from 
cDNA and inserted into pET28a or pET28b (Novagen) to generate His-RIPK, 
His-BIK1 and His-PBL1. 

To generate constructs for protoplast transfection assay, RIPK, AvrAC and its 
derivatives were PCR-amplified and inserted between XhoI and Csp45I sites of 
pUC19-35S-Flag-RBS vector’! to generate RIPK-Flag, AvrAC-Flag, NTD-Flag, 
ANTD-Flag, CTR-Flag, ACTR-Flag, AFic-Flag, ALRR-Flag and AvrAC(H469A)- 
Flag. RIPK and AvrAC were cloned into KpnI and Sall sites of pUC19-35S-HA-RBS! 
to generate RIPK-HA and AvrAC-HA, respectively. BIK1—Flag, BIK1-HA and BIK1 
mutants were described previously’. 

To generate avrAC knockout mutants in Xcc8004 and XccB186, an internal 
sequence corresponding to nucleotides 30-500 of the avrAC coding region was 
PCR-amplified from genomic DNA of Xcc8004 and cloned into the suicide 
plasmid pK18mob*. The recombinant plasmid was transferred into the wild-type 
Xcc strains by triparental conjugation, and the single crossover mutants were 
confirmed by PCR and designated Xcc8004A4avrAC and XccB186AavrAC. To 
complement the Xcc8004AavrAC mutant, the 400 bp upstream sequence of the 
avrAC coding region was PCR-amplified from the genomic DNA of Xcc8004 and 
ligated between the XbaI and HindIII sites of pLAFR6 (ref. 9), generating an 
intermediate plasmid carrying avrAC promoter sequence. A full-length avrAC, 
avrACALRR or avrACAFic fragments were PCR-amplified using the pUC19-35S- 
Flag-RBS plasmids containing the corresponding avrAC fragments as templates 
(see above) before integrated into the intermediate plasmid. The resulting con- 
structs were introduced into the Xcc8004AavrAC strain by triparental conjugation. 

The avrAC coding region was PCR-amplified from Xcc8004 genomic DNA, 
ligated into a modified pER8 vector*'**. The resulting clone containing AvrAC- 
Flag under the control of the oestrogen-inducible promoter was transformed into 
Arabidopsis (Col-0) by Agrobacterium-mediated transformation according to 
standard protocols. Transgenic plants were selected on Murashige and Skoog 
(MS) plates containing hygromycin. Two independent transgenic lines were 
selected for experiments. The transgenic plants were sprayed with 50 mM oestradiol 
in a 0.01% silwet L-77 solution for 12h to induce AvrAC protein expression. 
Dual reporter assay. AvrAC and its truncation mutants in the pUC19-35S-Flag- 
RBS vector were co-transfected with FRK1::LUC (firefly luciferase) and 35S::RLUC 
(Renilla luciferase) into Arabidopsis protoplasts. The protoplasts were incubated 
overnight under low light, treated with 1 1M flg22, 1 1M elf18, or 200 ng ml 
chitin (Sigma) for 3 h. Protein was then isolated, and LUC activity was recorded by 
using the Dual-Luciferase Reporter system (Promega) according to the manufac- 
turer’s instructions. 

Oxidative burst. Oestradiol-induced leaves were sliced into 1 mm strips, incu- 
bated in 200 kl water in a 96-well plate for 12 h before the addition of 1 1M flg22 in 
200 ul reaction buffer supplement with 20mM luminol and 1 1g horseradish 
peroxidase (Sigma). Luminescence was recorded with a Luminometer 
(Promega) for 20 min. 

MAPK activity assay. Two-week-old wild-type and transgenic seedlings grown 
on 1/2 MS medium were sprayed with 10 1M flg22 or water containing 0.02% 
Silwet L-77 for 10 min and frozen in liquid nitrogen. The seedlings were homo- 
genized in an extraction buffer containing 50 mM HEPES-KOH (pH 7.5), 150 mM 
KCl, 1mM EDTA, 0.2% Triton-X 100, 1 mM DTT, complete protease inhibitors 
(Roche), and phosphatase inhibitors (Fisher). The protein concentration was 
determined using a Bio-Rad Bradford protein assay kit, and equal amounts of 
total protein were electrophoresed on 10% SDS-PAGE. An anti-pERK antibody 
(no. 4370S, Cell Signaling) was used to determine phosphorylation state of 
MPK3, MPK4 and MPK6 in an immunoblot. A duplicate blot was reacted with 
anti-MPK3 antibodies (Sigma) to determine the amount of total MPK3. For 
MAPK activation by MKK5””, protoplasts prepared from wild-type plants were 
transfected with the indicated plasmids, and protein was isolated 12h later for 


immunoblot analyses. The identities of phosphorylated MAPKs were labelled as 
described previously**’. 

Bacterial growth and hypersensitive response assays. Five-week-old 
Arabidopsis leaves were infiltrated with the indicated Xcc or Pst bacteria at 
10°c.f.u. ml’, and the bacterial population in the leaf was counted at the indicated 
times. Each data point consisted of at least four replicates. For the hypersensitive 
response assay, wild-type plants were first infiltrated with Xcc8004 or 
Xcc8004AavrAC mutant bacteria at 10’ c.f.u. ml’ 1 day before infiltrating 5 X 10” 
cfu.ml~’ Pst or Pst(avrB). Plants were photographed 6h after the second 
inoculation. 

A competitive index assay*® was used to compare virulence between wild-type 
and AavrAC mutant strains of Xcc. Briefly, equal amounts (c.f.u.) of wild-type and 
AavrAC mutant Xcc bacteria were mixed and inoculated into plants, and leaf 
bacterial number for each strain was determined by plating on NYG agar medium 
supplemented with appropriate antibiotics. The ratio between the mutant and 
wild-type bacteria (competitive index) was calculated at the desired times. For 
virulence assay on cabbage, fully expanded leaves of 4-week-old plants were 
clipping inoculated” with a mixture of Xcc8004 and Xcc8004AavrAC bacteria at 
an equal concentration of 10°c.fu.ml~', maintained at high humidity for 48h 
under a transparent dome. Leaf bacterial number and competitive index were 
determined 6 days after inoculation. For virulence assay on Arabidopsis plants, 
5-week-old Arabidopsis leaves were infiltrated with a mixture of wild-type 
XccB186 and XccB186AavrAC mutant bacteria at an equal concentration of 
10° c.f.u. ml *. Leaf bacterial number and competitive index were determined 3 
days after inoculation. 

CyaA secretion assay. The plasmid pCPP3221 containing an AvrPto-Cya fusion 
sequence was introduced into the wild-type Pst (DC3000) and its AhrcC mutant 
derivative. The derived strains were inoculated at 1X 10°c.f£u.ml~' into 
Arabidopsis leaves that had been pre-treated with 10’ c.f.u.ml~' Xcc8004 or 
Xcc8004AavrAC for 24h. Amounts of AvrPto-Cya secretion were determined as 
described previously”*. Briefly, leaf samples were ground in liquid nitrogen and 
resuspended in 0.1 M HCl. The total protein content of each sample was deter- 
mined using the Bradford assay (Bio-Rad). A direct CAMP (cAMP) immunoassay 
kit (Sigma) was used to measure cAMP levels following the manufacturer’s 
instruction. 

Recombinant proteins expression and GST pull-down assay. The recombinant 
proteins were affinity purified following the manufacturer’s instruction. For pull- 
down assays, 10 jig each of His-RIPK, His-BIK1 and GST-AvrAC were incubated 
ona rotator with glutathione agarose beads (GE Healthcare) in a buffer containing 
25 mM Tris-Hcl (pH 7.5), 100 mM NaCland 1 mM DTT for 3 h, washed five times 
with a buffer containing 25 mM Tris-Hcl (pH 7.5), 100 mM NaCland 1 mM DTT. 
The bound protein was eluted with 15 mM GSH and immunoblotted with a mouse 
anti-His monoclonal antibody (Sigma-Aldrich). 

Co-immunoprecipitation assay. The protoplasts were transfected with the indi- 
cated constructs, incubated for 12h, treated with either HO or 1 uM flg22 for 
3-10 min, and total protein was isolated with an extraction buffer (see MAPK 
activity assay). For anti-Flag immunoprecipitation, total protein was incubated 
with an agarose-conjugated anti-Flag antibody (Sigma-Aldrich) for 4 h, washed six 
times with a buffer containing 50 mM HEPES-KOH (pH 7.5), 150 mM KCl, 1 mM 
EDTA, 0.2% Triton-X 100, 1mM DTT, and the bound protein was eluted with 
0.5 mg ml! 3XFlag peptide. For anti-HA immunoprecipitation, total protein was 
pre-cleared with protein A agarose (Millipore) for 1h, followed by precipitation 
with 2 jg anti-HA antibody (TianGen) together with protein A agarose for 4h. 
Immunoprecipitates were separated by a 10% NuPAGE gel (Invitrogen) and 
detected by anti-HA or anti-Flag immunoblot. 

In vitro kinase assay. His-RIPK, His—BIK1 and GST-AvrAC were co-expressed 
in E. coli (strain BL21), and the His-tagged protein was purified by Ni-NTA affinity 
chromatography. Kinase assays were performed in 25mM Tris-HCl (pH 7.5), 
10mM MgCl, 100 nM ATP, 1mM DTT, y-*’P ATP (5 uCi), and 3 1g recombin- 
ant protein in a total volume of 20 il. Reactions were allowed to proceed for 30 min 
at 30 °C and terminated by adding 5X protein loading buffer and boiling for 5 min 
at 100 °C. The proteins were separated on a 12% SDS-PAGE gel, and incorporated 
radiolabel was visualized by autoradiography. As an equal loading control for 
proteins used in the kinase reactions, the SDS-PAGE gels were stained by 
Coomassie blue. 

BIK1 phosphorylation and migration shift assay. Arabidopsis protoplasts were 
transfected with HA-tagged BIK1 or BIK1 mutants’ together with AvrAC-Flag or 
truncation mutants of AvrAC, treated with 11M flg22 and total protein was 
extracted at various time points. Samples were separated by 10% SDS-PAGE gels 
followed by anti-HA immunoblot. For phosphatase treatment, total protein from 
protoplasts or affinity-purified recombinant protein was treated with 4 protein 
phosphatase (New England Biolabs) according to the manufacturer’s instructions. 
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In vitro uridylylation assay. GST-tagged AvrAC derivative proteins, His-tagged 
RIPK, BIK1 and PBL1 proteins were affinity purified and directly used for the 
uridylylation assay. The GST-PBS1 recombinant protein (a gift from J. Chai) was 
cleaved with PreScission Protease (GE) to remove GST before the enzymatic assay. 
Approximately 400 ng of GST-AvrAC, GST-ACTR, GST-CTR, GST-AFic or 
GST-AvrAC(H469A) were incubated with 2 ug of His-RIPK, His-BIK1, PBS1, 
His-PBL1, or His-tagged kinase domain of FLS2 or BAK] (ref. 39) in 20 ul 
reaction buffer containing 25mM Tris-HCl (pH 7.5), 25mM MgCh, 500 1M 
UTP, 1mM DTT, «-*P UTP (5 Ci) for 30 min at 30 °C. Reactions were stopped 
by addition of loading buffer. The products were separated on 12% NuPAGE gels 
(Invitrogen) and visualized by autoradiography. 

Mass measurement of the recombinant AvrAC protein. An affinity-purified 
GST-AvrAC protein solution was loaded into an analytical capillary column 
(75 um X 2 cm) packed with Poros 20 R2 packing material (AppliedBiosystems). 
An Agilent 1100 binary pump was used to generate HPLC gradient as follows: 
0-100% B in 60 min (A = 0.1 M acetic acid in water; B = 0.1 M acetic acid/70% 
acetonitrile). The eluted proteins were sprayed into a QSTAR XL mass spectro- 
meter (AB Sciex). The spray voltage was set at 2,100 V and the data was acquired in 
MS mode. The protein charge envelop from the raw spectrum was de-convoluted 
into non-charged form by the BioAnalyst software provided by the manufacturer. 
Mapping UMP modification sites. Affinity-purified GST-AvrAC, His-BIK1, 
His-RIPK, BIK1-Flag and RIPK-Flag proteins were separated by SDS-PAGE. 
Protein bands on the SDS-PAGE gel were de-stained, and then reduced in 10 mM 
DTT at 56 °C for 30 min followed by alkylation in 55 mM iodoacetamide at dark 
for 1h. After that the protein bands were in-gel digested with sequencing grade 
trypsin (10 ng pl! trypsin, 50 mM ammonium bicarbonate, pH 8.0) overnight at 
37 °C. Peptides were extracted with 5% formic acid/50% acetonitrile and 0.1% 
formic acid/75% acetonitrile sequentially and then concentrated to ~20 ul. The 
extracted peptides were separated by an analytical capillary column (50 tum X 10 
cm) packed with 5 jm spherical C18 reversed phase material (YMC). An Agilent 
1100 binary pump was used to generate the HPLC gradient as follows: 0-5% B in 5 
min, 5-40% B in 25 min, 40-100% B in 15 min (A = 0.1 M acetic acid in water; 
B= 0.1 M acetic acid/70% acetonitrile). The eluted peptides were sprayed into a 
QSTAR XL mass spectrometer equipped with a nano-ESI ion source. The mass 
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spectrometer was operated in information-dependent mode with one MS scan 
followed by three MS/MS scans for each cycle. Database searches were performed 
on an in-house Mascot server (Matrix Science) and the following variable 
modifications were included: oxidation on methionine, carbamidomethylation 
on cysteine, UMP modification on serine or threonine residues, AMP modifica- 
tion on serine, threonine or tyrosine residues. 
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When viewed as an elementary particle, the electron has spin and 
charge. When binding to the atomic nucleus, it also acquires an 
angular momentum quantum number corresponding to the 
quantized atomic orbital it occupies. Even if electrons in solids 
form bands and delocalize from the nuclei, in Mott insulators they 
retain their three fundamental quantum numbers: spin, charge 
and orbital’. The hallmark of one-dimensional physics is a break- 
ing up of the elementary electron into its separate degrees of 
freedom’. The separation of the electron into independent quasi- 
particles that carry either spin (spinons) or charge (holons) was 
first observed fifteen years ago’. Here we report observation of 
the separation of the orbital degree of freedom (orbiton) using 
resonant inelastic X-ray scattering on the one-dimensional Mott 
insulator Sr,CuO 3. We resolve an orbiton separating itself from 
spinons and propagating through the lattice as a distinct quasi- 
particle with a substantial dispersion in energy over momentum, of 
about 0.2 electronvolts, over nearly one Brillouin zone. 

It was pointed out in the 1970s that in a solid not only the charge and 
spin of electrons can become ordered—leading to magnetism—but 
also the electrons orbital degree of freedom’. This observation sparked 
a field that has gone on to produce a number of important results. 
Although a physical electron combines spin, charge and orbital, 
theoretically an electron can be considered a bound state of the three 
independent, fundamental quasi-particles: a spinon, carrying the 
electron’s spin; a holon (or chargon), carrying its charge; and an 
orbiton, carrying its orbital degree of freedom. 

A remarkable and fundamental property of one-dimensional (1D) 
systems is that electronic excitations break up into deconfined spinons 
and holons. This was predicted decades ago (ref. 2 and references 
therein) and confirmed in the mid 1990s by angle-resolved photo- 
emission spectroscopy experiments*°. The spin-charge separation is 
an example of particle fractionalization, a phenomenon in which the 
quantum numbers of quasi-particles are not multiples of those of the 
elementary particle, but fractions. This effect is one of the most 
unusual manifestations of collective quantum physics of interacting 
particles and is a profound concept that has found its way into a 
number of theories, for example that describing high-temperature 
superconductivity in copper oxides®”. 

To search for the further fractionalization of the electron, we 
consider the excitation of a copper orbital degree of freedom in the 
antiferromagnetic spin-chain compound Sr,CuO3. The spin-orbital 
separation process that we are looking for is analogous to the spin- 
charge separation mechanism (Fig. 1b). The latter occurs, for instance, 
when an electron is annihilated, removing a single spin and leaving 
behind a hole in the antiferromagnetic chain. This hole can start to 
propagate freely only after exciting one spinon (a domain wall in the 
antiferromagnetic chain). Subsequently, the spinon can delocalize and 


separate itself completely from the holon. When instead of creating a 
hole, as typically is done in a photoemission experiment, an electron is 
excited from one copper 3d orbital to another, the phenomenon of 
spin-orbital separation can in principle occur (Fig. 1a). The orbiton 
created in this manner may also deconfine after exciting a spinon, thus 
splitting the electron into its orbital and spin degrees of freedom’. 

Here we use high-resolution resonant inelastic X-ray scattering 
(RIXS) to search experimentally for spin-orbital separation in the 
quasi-1D copper oxide Sr,CuO; (for material details, see Supplemen- 
tary Information, section 1). We observe deconfinement of the spinon 
and orbiton during orbital excitation from the ground-state copper 3d 
x° — y’ orbital to an excited copper 3d xy or xz orbital (Fig. 1c—e). For 
simplicity, we will from henceforth use the so-called ‘hole’ language: 
although nominally there are nine electrons in the 3d orbitals of 
the Cu’* ion in Sr,CuOs, by using the ‘electron-hole’ transformation 
we can map the problem onto an effective system with one particle 
occupying a single 3d orbital (Supplementary Information, section 2). 

We measure an orbiton dispersion that is almost as large as the 
dispersion of the two-spinon continuum at low energies. As for 
spin-charge separation*®, the orbiton dispersion has periodicity 7 
(see Fig. 1c and see discussion below), which indicates the presence 
of an orbiton liberated from the spinon. 

We measured the orbital excitations of Sr,CuO; using RIXS at the L; 
edge of the copper ion. RIXS is a second-order scattering technique 
and can excite transitions between the copper 3d states of different 
symmetry (orbital excitations), owing to the involvement of two sub- 
sequent electric dipole transitions’? (Supplementary Information, 
section 3). With the unique capability of RIXS also to probe spin 
excitations'’’ and to vary the photon momentum transfer, the dis- 
persion of orbital and spin excitations can be mapped out across the 
first Brillouin zone'''’. The experiments were carried out at the 
ADRESS beamline of the Swiss Light Source at the Paul Scherrer 
Institut'”’®. 

For fixed momentum transfer, q, along the chains, peaks in the RIXS 
spectrum at constant energy transfer reveal the presence of charge- 
neutral elementary excitations and are visible in the RIXS intensity 
map of Sr,CuO; across the copper L; edge in Fig. 2a. The spectrum for 
which the incident energy was precisely tuned to the resonance 
maximum of the absorption spectrum is shown in Fig. 2b. In both 
plots, the excitations of the spin, orbital and charge degrees of freedom 
are indicated. The momentum dependence and, in particular, the 
dispersion of the spin and orbital excitations (Fig. 1c and Supplemen- 
tary Fig. 2a) are indicative of their collective nature. 

For energy transfers of up to ~0.8 eV purely magnetic excitations 
are present, but the spectrum between ~1.5 and ~3.5 eV corresponds 
to excitations from the copper 3d x* — y’ ground state to orbitals of xy, 
xz/yz and 3z” — r° symmetry (Fig. 1d, e). These peaks correspond to 
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Figure 1 | Spin-orbital separation process in an antiferromagnetic spin chains and photon energy transfer (for details, see main text and 

chain, emerging after exciting an orbital. a, b, Sketches of spin-orbital Supplementary Information, section 3). a, [100] lattice constant; a.u., arbitrary 
separation (a) and spin-charge separation (b), generated in processes of RIXS —_ units. Data obtained at Y = 130°. d, Geometry of the CuO; chain, with the 
and angle-resolved photoemission spectroscopy, respectively. The spin is ground-state copper 3d x* — y* orbitals in the middle of each plaquette and 
represented by the arrow and the charge is represented by the green dot. In oxygen sites at the plaquette corners. e, Orbital symmetries of x* — y’ and 

a, the lower and upper lines represent the ground-state and excited-state excited 3d orbitals. In panels c-e, ‘hole’ language is used (Supplementary 


orbitals, respectively. c, RIXS intensity map of the dispersing spin and orbital Information, section 2). 
excitations in Sr,CuO; as functions of photon momentum transfer along the 


orbital excitations (called also d-d excitations), and not, for example, 

to charge transfer excitations, the intensity of which is non-zero for 

Spin Orbital Charge energies up to ~6 eV but here for the case of L-edge RIXS is at least an 

: i Le order of magnitude lower’®. The orbital assignment of these excitations 

é was unambiguously verified by comparing their energy at q = 0 with 

, ab initio quantum chemistry cluster calculations’? (Supplementary 
10 Information, section 5, for detailed results). 

Zooming into the ‘magnetic’ part of Fig. 1c, between 0 and 0.8 eV in 

energy transfer, reveals strongly dispersing spin excitations. At the 

1 lower boundary the dispersion has period m, and at the continuum 

(upper boundary) it has period 21 (Fig. 3a). These RIXS data agree very 

well with recent inelastic neutron scattering studies on Sr,CuO3 

| (ref. 20). The simultaneous presence in the spectrum of a lower edge 

0 2 4 6 10 with period 7 and an upper one with period 27 indicates directly that 

in the spin chain the magnetic excitations with spin 1 break up and 

fractionalize into two-spinon (and higher-order) excitations that make 

up a continuum”’, with each spinon having spin 1/2. These spectra 

confirm that RIXS for magnetic excitations probes the well-known 

spin dynamical structure factor as theoretically predicted''"”, in agree- 

ment with recent studies on TiOCI (ref. 21). The excellent statistics of 
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Figure 2 | Energy dependence of elementary excitations in Sr,CuO; 
observed with RIXS at the copper L3-edge resonance. a, RIXS map of 

Sr CuO; (log scale) as a function of photon excitation energy (left axis) and 
energy transfer (bottom axis). The superimposed black curve shows the total 
fluorescence yield X-ray absorption spectrum. The dashed line marks the 
maximum energy of the copper L3-edge resonance. b, RIXS line spectrum 
measured at the resonance maximum (along the dashed line in a). All data was 
obtained at a scattering angle of ¥ = 90° and for momentum transfer along the 
Energy transfer (eV) chain of q = 0.189 X 2n/a (Methods). 
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Figure 3 | Dispersion of magnetic excitations: experimental data and 
simulation. a, RIXS energy transfer spectra of the magnetic two-spinon 
continuum (see the spinon region in Fig. 1c). Data obtained at Y = 130°. b, Fit 
(red line) of the experimental data (dots) with the two- and four-spinon 
dynamical structure factor” (green line), convolved with a Gaussian 
distribution (light-grey shaded region) to account for the total instrumental 
resolution of 140 meV. The low-energy peak at around 50 meV (black line) is 
elastic and phonon scattering. The values of momentum transfer for the 
selected spectra (shown in units of 27/a) are indicated by arrows in a. 


the data further allow for a direct comparison of the RIXS line shapes 
with the exact two- and four-spinon dynamical structure factor of the 
spin-1/2 Heisenberg chain. In Fig. 3b, we show the fits for three 
selected momentum transfer values using the exact two- and four- 
spinon dynamical structure factor, S(q, ) (where q and w stand for 
the momentum and energy of the created excitation, respectively) in 
the representation of ref. 22. The obtained exchange coupling, 
J = 249 meV, is in very good agreement with the value obtained from 
inelastic neutron scattering data”®. 

Having unambiguously identified the fractionalized spinon excita- 
tions in the low-energy sector, we now concentrate on the orbital 
excitations spectrum in Fig. 4. We find that these are strongly 
momentum dependent and have a novel, distinct dispersion. This 
proves that the orbital excitations observed here are of collective nature. 
The xz excitation has the largest dispersion, of ~0.2 eV, and has a 
spectrum containing two peculiar components: a lower branch dispers- 
ing with periodicity m and, above that, an incoherent spectrum with a 
double-oval shape. This spectrum is strikingly similar to seemingly 
unrelated angle-resolved photoemission spectra of 1D copper oxides, 
which evidence spin-charge separation (see, for example, fig. 3 of ref. 3). 
This is an indication that the observed orbital dispersion is related to an 
analogous separation of degrees of freedom. 

To test this conjecture, we derived a microscopic model that 
describes the spin-orbital interactions in Sr,CuO3 (Methods). The 
low-energy Kugel’-Khomskii Hamiltonian for this was obtained*”* 
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Figure 4 | Dispersion of orbital excitations: comparison between 

experiment and ab initio calculations. a, c, Comparison between the orbital 
region in Fig. 1c (a, left) and theory, using the Jo—J model (c, right). b, Lower 
and upper edges of the orbiton dispersion (solid black and grey lines) and the 
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from the charge transfer model of Sr,CuOs in ref. 24. The crucial part 
of the Hamiltonian, responsible for the xz orbital propagation, is 


H=—Jo (Go G41 th.c.) 
do 


1 
+J 7 SjSj41+£o DL — 1) ) 
i j 
where J and Jo are respectively the spin and orbital exchange constants, 
both of which are fixed by the charge transfer model”; Eo is the xz orbital 
on-site energy; Cig and cj, are operators that respectively annihilate and 


create particles in xz orbital with spin o; S; is the spin of the particle in 
x y orbital; 1 — nj counts the number of particles in xz orbital; and 
‘h.c. denotes Hermitian conjugate. The first term in the Hamiltonian 
describes the propagation of orbital excitations through the lattice. The 
second term represents the usual Heisenberg interaction between spins, 
which vanishes on bonds where an orbital excitation is present 
(Methods). 

From purely a mathematical standpoint, equation (1) is identical to 
the 1D t-J model—describing the hopping of holes doped into an 
antiferromagnetic spin chain in which the spins interact via super- 
exchange J—with the orbital superexchange, Jo, taking the place of t, 
the hole-hopping amplitude in the t-J model. This implies that the 
propagation of a single orbital excitation in the Jo-J model above is 
equivalent to the propagation of a single hole in the 1D t-J 
Hamiltonian. Because in the 1D t-J model a hole breaks up into a free 
spinon and holon, in the Jo-J model the orbital excitation will separate 
into a free spinon and orbiton. 

Before calculating RIXS spectral functions and quantifying the sepa- 
ration of spin and orbital degrees of freedom in Sr,CuOs, it is instruc- 
tive to consider the overall features of the momentum-dependent RIXS 
spectra within a slave-boson picture in terms of an orbital-spin sepa- 
ration ansatz. This ansatz is analogous to the mean-field slave-boson 
picture for charge-spin separation**”®, which stipulates that the hole 
spectral function is given by the convolution of the spectral function of 
a free holon and a free spinon. In orbital-spin separation ansatz, free 
orbitons and spinons, created by the operators 0; and s;*, form the 
particle creation operator c;+ =s;* 0;. On applying this formalism to the 
Hamiltonian in equation (1), the energies of non-interacting orbitons 
and spinons with momentum k become é0(k) = Eo — 2Jo cos (k) and 
és(k) = —J cos (k), respectively. In this ansatz, the spectral function for 
the orbital excitation produced in RIXS, cj, = x ce 4a; (where q, 
the x component of q, is here the momentum transfer along the CuO; 
chain direction), is determined by a convolution of these two disper- 
sions. The resulting excitation continuum is shown in Fig. 4b and has 
three defining features*’*: a prominent lower edge with energy, 
E(q) = Eo —2Jo| sin (q)|, that is entirely determined by the dispersion 
of the orbiton; a _ distinctive upper orbiton edge at 
E(q) = Eo + 2Jo| sin (q)|; and a broader upper limit of the orbiton- 


spinon continuum at F(q)=Eo4 VP | Aa 4JJo cos(q). These 
dispersion curves are directly compared to our RIXS experiments in 


c 
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upper edge of the spinon-orbiton continuum (dashed line), as calculated using 
the orbital-spin separation ansatz. See main text and Supplementary 
Information for further details. 
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Fig. 4b. The measured spectral weight that accumulates at the first two 
lower-energy features does so with periodicity 1, which is a direct 
consequence of spin-orbital separation. 

To quantify the spectral weights within the orbiton-spinon con- 
tinuum, we have calculated for the J>—J Hamiltonian the orbital excita- 
tion Green’s function by exactly diagonalizing the Hamiltonian on 28 
lattice sites (as in the spin-charge separation studies’, finite-size effects 
are negligible (they are estimated at ~0.01 eV)). From this, we cal- 
culate the RIXS spectrum following ref. 13, that is, by expressing the 
RIXS amplitude as a product of the single-ion local RIXS effective 
operator and the orbital excitation Green’s function (Methods). 

We find excellent agreement between theory and experiment 
(Fig. 4): both the sine-like xz orbiton dispersion and the xz spinon- 
orbiton continuum (the “double-oval’ incoherent spectrum), which are 
the hallmarks of spin—-orbital separation, are present in the theory. The 
calculations also show that, in contrast to the xz orbital, the xy orbital 
has a small dispersion and that the excitations of both the 3z* — 1° 
orbital and the yz orbital are dispersionless, as is observed experimentally. 
This is an independent merit of the model because no fitting of dis- 
persions to experimental data is involved. 

The large orbiton dispersion observed in this study is the key feature 
that distinguishes Sr,CuO3 from other systems with orbital excita- 
tions*”’, and relies on the 1D character of Sr,;CuO3. In a system of 
higher dimensionality, orbitons interact with magnetic excitations, 
which tend to slow them down and thus reduce their dispersion. In 
one dimension, orbitons can avoid these renormalization effects by 
means of spin-orbital separation. 


METHODS SUMMARY 


We applied the technique of high-resolution RIXS with the incident photon energy 
tuned to the L; edge (2p3/2 > 3d resonance) of the copper ion (around 931 eV) and 
a total experimental resolution of 140 meV. The experiments were performed at 
the ADRESS beamline of the Swiss Light Source at the Paul Scherrer Institut'”’®. 
The RIXS spectrometer was located at a fixed scattering angle of either Y¥ = 90° or 
130° and was collecting signal within the solid angle of (19.3 X 3.3) mrad” (hori- 
zontal X vertical detection direction; for a sketch of the experimental geometry, see 
Supplementary Fig. 1). 

Single-crystal samples of SrzCuO3 were grown by the floating-zone method and 
freshly cleaved before the RIXS experiment. The surface normal to the sample, 
[010], and the propagation direction of the chains, [100], were oriented parallel to 
the scattering plane. The sample was cooled with a helium-flow cryostat to 14K 
during the measurements. Incident photons were linearly polarized in the scatter- 
ing plane (1-orientation). The momentum transfer along the chains, q, was varied 
by changing the incidence angle in steps of 5 + 1°. 

Theoretical modelling was done by deriving a Kugel’-Khomskii Hamiltonian 
from the charge transfer model and then writing it in a representation similar to 
the one used for the t-J model. Using this model, we calculated the spectral 
functions for the case of a single orbital excitation in the antiferromagnetic back- 
ground by solving the Hamiltonian numerically on 28 sites. Such calculations were 
repeated separately for the two distinct dispersive orbital excitations, whereas for 
the non-dispersive excitations we confirmed that the parameters of the model did 
not allow for a dispersion larger than the experimental resolution. Finally, the 
RIXS cross-section was calculated by multiplying the calculated spectral functions 
for the orbital excitations by the local RIXS form factors. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Experiment. We applied the technique of high-resolution RIXS with the incident 
photon energy tuned to the L3 edge (2p3/2 > 3d resonance) of the copper ion 
(around 931 eV) and a total experimental resolution of 140 meV. The experiments 
were performed at the ADRESS beamline of the Swiss Light Source at the Paul 
Scherrer Institut'’”*. Single-crystal samples of Sr;CuO; were grown by the floating- 
zone method and freshly cleaved before the experiment. The samples were mounted 
such that the surface normal, [010], and the chain propagation direction, [100], 
were oriented parallel to the scattering plane (Supplementary Information, section 
4, and Supplementary Fig. 1). The polarization vector of the incident light, &;,, was 
parallel to the scattering plane (1-orientation). This yielded the maximum cross- 
section for the copper 2p°3d° > 2p°3d"° transition. The RIXS spectrometer was 
located at a fixed scattering angle of either Y = 90° or 130°, collecting signal within 
a solid angle of (19.3 x 3.3) mrad? (horizontal X vertical detection direction). The 
momentum transfer along the chains, q, was varied by changing the incidence angle 
in steps of 5 + 1°. This results in the RIXS signal being polarization dependent and 
the relative intensity of the orbital excitations varying as a function of q. (For details 
on the polarization dependence of the orbital excitations, see the discussion of the 
RIXS cross-section below.) Supplementary Fig. 2a shows the RIXS line spectra, 
which correspond to the intensity map data in Fig. 1c, and Supplementary Fig. 
2b shows the variation in the polarization of the incident photons with q. We 
estimate the systematic errors in the momentum transfer to be about 0.006 X 21/ 
aand 0.013 X 21/a at the edge and the centre of the Brillouin zone, respectively. 
The error in determining the energy zero of the energy transfer scale is about 
30 meV. The sample was cooled with a helium-flow cryostat to 14K during the 
measurements. 
Theory of spin excitations. Owing to the particular choice of the experimental 
geometry, the RIXS cross-section for spin excitations (that is, the magnetic part of 
the spectrum extending up to ~0.8 eV) is directly proportional to the two-point 
dynamical spin-spin correlation function (local RIXS form factors are independ- 
ent of transferred momentum here); see refs 9, 11-14. Therefore, the magnetic part 
of the RIXS spectrum was fitted to the two-point dynamical spin-spin correlation 
function (see below), convolved with a Gaussian to account for experimental 
resolution and adding a low-energy Gaussian covering the elastic and phonon 
peak around 50 meV. This captured most of the measured intensity with nearly 
constant J, amplitude and resolution. Fixing these three parameters to their mean 
values gives reasonable fits for all q. The time-normalized data sets were corrected 
for the effective scattering volume, Ver: 
1 i Hy () si (0) 

Ver fa sin(B) 
Here (sin(0)/sin(f))(q) is a geometric term, @ and f being the angles between the 
sample surface and the direction of photon incidence and, respectively, the direction 
of photon detection; and ({1;/J2)(«) is an outgoing-energy-dependent absorption 
factor derived from the total fluorescence yield X-ray absorption spectra. 

The exact solvability of the Heisenberg spin-1/2 chain is exploited to compute 
its two-point dynamical spin-spin correlation function. At zero magnetization, 
this is predominantly carried by two- and four-spinon intermediate states whose 
exact contributions can be written in the thermodynamic limit in terms of fun- 
damental integrals by using the vertex operator approach”. The numerical evalu- 
ation of these integrals provides the dynamical structure factor throughout the 
Brillouin zone, yielding sum-rule saturations of the order of 98%. 

Theory of orbital excitations. We first calculate the energies of the 3d orbitals in 
Sr,CuO; using the ab initio quantum chemistry method (Supplementary 
Information, section 5). Although this method is not tailored to calculate the 
dispersion of orbital excitations (for which model calculations are needed) owing 
to the finite size of the cluster, it identifies from first principles the energies of 
orbital excitations in SrxCuO; for q = 0 transferred momentum. These energies 
have been found to be in close agreement with both optical absorption and RIXS 
experiments'”. Owing to the large separation of the d-d excitations energies 
(Supplementary Table 1), we can separately model the momentum-dependent 
RIXS cross-section for each orbital excitation for the case of non-negligible 
inter-orbital hopping. For this (see the next section), the Kugel’-Khomskii 
Hamiltonian for the case of a single orbital excitation (for example the 
x - y — xz transition) from a ferro-orbital ground state (that is, one in which 
all x° — y* orbitals are occupied) is derived on the basis of the established charge 
transfer model for Sr,CuO3. Similar calculations are performed for the other 
orbital excitation symmetries (see the next section) and the RIXS cross-section 
based on the solution to these model calculations is calculated (see the discussion 
of RIXS cross-section below). 

Model Hamiltonian. In Mott insulators, the Kugel’"-Khomskii model is the effec- 
tive low-energy superexchange model for coupled spin and orbital excitations; 


(4) 
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compare with ref. 1. We first consider only an xz orbital excitation, which may hop 
between the copper sites in a ‘three-step’ perturbative superexchange process with 
an energy scale ~t,t,/2U. First the particle in the excited orbital moves from site j 
to the neighbouring site j + 1 by hopping with amplitude ~f,; next an intermediate 
state, in which two particles are on site j+1 and which has an energetic cost 
equivalent to the on-site Coulomb repulsion ~ U, is formed; and finally the particle 
in the ground-state orbital moves from site j+ 1 to site j by hopping with amplitude 
t,. Thus, we obtain 


Ho=—IJo >> (66. G+1,0+h.c.)+£o > —1) (2) 
jo j 


where Jo = (3R; + Ro)2tito/U and Eo is the energy cost of the orbital excitation. In 
Jo» Ry = 1/(1 — 3) and R, = 1/(1 — n) originate in the multiplet structure of the 
intermediate states of the superexchange processes and depend on the ratio, 
1 = Ju/U, of the Hund’s exchange to the Coulomb repulsion. Although the above 
equation is given for the Mott-Hubbard limit of the superexchange model, in our 
calculation we modified the parameters to account for superexchange processes on 
oxygen atoms in Sr,CuO3. For a standard set of charge transfer parameters”, this 
gives Jo ~ 0.075 eV without any fitting to the experimental data. The values of Ep 
are determined from the RIXS experiment and are within 10% of the on-site orbital 
energies obtained using ab initio quantum chemistry cluster calculations” for four 
CuO, plaquettes in Sr,CuO; (Supplementary Information, section 5). 

In equation (2), cjc is a fermion operator, acting in the restricted Hilbert space 
with no double occupancies, that creates an orbital excitation (a hole in the spin 
background in the language of the t-J model) at site j with spin o, and 
1—nj=1-)), Ge Cjc_is the number operator that counts the number of orbital 
excitations in the chain. Note that this ‘fermionization’ of the 1D problem was 
performed by replacing the standard orbital pseudospin operators” with fermions 
by Jordan-Wigner transformation (see ref. 8 for details). 

The second part of the Hamiltonian in equation (1) corresponds to the spin 
dynamics on the bonds where the orbital excitation is not present: 


Hg =] ~ Se +1 
j 


Here J = 4t,"/U is now the well-known Heisenberg antiferromagnetic superex- 
change constant. In sum, Hs and Ho constitute the Hamiltonian H from equa- 
tion (1). 

Excitations from the ground-state 3d x” — y* orbital to either the xy orbital or 

the xz orbital are described by H, but with different effective parameters because 
the hopping amplitudes from each of these orbitals to the neighbouring bonding 
oxygen orbitals and, subsequently, to the neighbouring copper sites differ. In 
particular, for the xy orbital the effective hopping amplitude along the chain is 
~25% smaller than that for the xz orbital owing to the formation of bonding and 
antibonding states with 2p orbitals on oxygen atoms outside the copper-oxygen 
chain. For orbital excitations involving the yz and 3z” — r’ orbitals, the dispersion 
vanishes because the hopping matrix elements to the neighbouring oxygen orbitals 
along the chain are either much smaller than for the xy or xz orbital excitations 
(3z* — 7 orbital) or are vanishingly small** (yz orbital). 
RIXS cross-section. Local effective RIXS operators have been derived for the 
Cu’* (3d’) electronic configuration'?*° (see also Supplementary Information, 
section 3). However, several simplifications beyond the local effective scattering 
operators as used in refs 12-13, 30 are justified, because the spin of the particle in 
the excited orbital is to a good approximation conserved (Hund’s exchange is one 
order of magnitude smaller than the on-site Coulomb repulsion®* during orbiton 
propagation). First, the spectrum obtained is independent of the spin of the par- 
ticle in the excited orbital. Second, the spin and the orbital character of the excited 
particle are conserved during propagation. The RIXS intensity for an orbital 
excitation is therefore given by the sum of the intensities for scattering from initial 
states with spin up or down to final states with the spin either flipped or conserved. 
This allows us to express the RIXS intensity as the product''’ of polarization- 
dependent intensities and the non-local dynamical structure factor O(q, «), which 
is defined as the spectral function of the single orbital excitation created using 
operator cj, and propagating with Hamiltonian H (ref. 8). This structure factor is 
either equal to one, for ‘non-dispersive’ orbital excitations (3z* — r° and yz orbitals; 
see previous section), or is calculated numerically by exactly solving the 
Hamiltonian H on a finite chain of 28 sites. The latter is done separately for xz 
and xy orbital excitations. We note that such a separation is possible because 
orbiton-orbiton interactions vanish for one-orbiton excitations in the chain. 
The results of these calculations are presented in Fig. 4c. 


30. van Veenendaal, M. Polarization dependence of L- and M-edge resonant inelastic 
X-ray scattering in transition-metal compound. Phys. Rev. Lett 96, 117404 (2006). 
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The genomic and transcriptomic 
architecture of 2,000 breast tumours 
reveals novel subgroups 


Christina Curtis?+*, Sohrab P. Shah****, Suet-Feung Chinb2*, Gulisa Turashvili?**, Oscar M. Rueda’, Mark J. Dunning’, 
Doug Speed”*+, Andy G. Lynch’, Shamith Samarajiwa', Yinyin Yuan!?, Stefan Griif’?, Gavin Ha®, Gholamreza Haffari’, 
Ali Bashashati’, Roslin Russell”, Steven McKinney*", METABRIC Groupt, Anita Langered®, Andrew Green’, Elena Provenzano®, 
Gordon Wishart®, Sarah Pinder, Peter Watson**!°, Florian Markowetz"?, Leigh Murphy”, lan Ellis’, Arnie Purushotham?", 


Anne-Lise Borresen-Dale®”, James D. Brenton?’, Simon Tavaréb”?"“, Carlos Caldash”*8 & Samuel Aparicio®* 


The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and 
transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene 
expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical 
follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy 
number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- 
and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer 
genes, including deletions in PPP2R2A, MTAPand MAP2K4. Unsupervised analysis of paired DNA-RNA profiles revealed 
novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, 
oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. 
Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR 
deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 
deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer 


population, derived from the impact of somatic CNAs on the transcriptome. 


Inherited genetic variation and acquired genomic aberrations contrib- 
ute to breast cancer initiation and progression. Although somatically 
acquired CNAs are the dominant feature of sporadic breast cancers, the 
driver events that are selected for during tumorigenesis are difficult to 
elucidate as they co-occur alongside a much larger landscape of random 
non-pathogenic passenger alterations'* and germline copy number 
variants (CNVs). Attempts to define subtypes of breast cancer and to 
discern possible somatic drivers are still in their relative infancy’, in 
part because breast cancer represents multiple diseases, implying that 
large numbers (many hundreds or thousands) of patients must be 
studied. Here we describe an integrated genomic/transcriptomic 
analysis of breast cancers with long-term clinical outcomes composed 
of a discovery set of 997 primary tumours and a validation set of 995 
tumours from METABRIC (Molecular Taxonomy of Breast Cancer 
International Consortium). 


A breast cancer population genomic resource 


We assembled a collection of over 2,000 clinically annotated primary 
fresh-frozen breast cancer specimens from tumour banks in the UK 


and Canada (Supplementary Tables 1-3). Nearly all oestrogen receptor 
(ER)-positive and/or lymph node (LN)-negative patients did not receive 
chemotherapy, whereas ER-negative and LN-positive patients did. 
Additionally, none of the HER2* patients received trastuzumab. As such, 
the treatments were homogeneous with respect to clinically relevant 
groupings. An initial set of 997 tumours was analysed as a discovery group 
and a further set of 995 tumours, for which complete data later became 
available, was used to test the reproducibility of the integrative clusters 
(described below). An overview of the main analytical approaches is 
provided in Supplementary Fig. 1. Details concerning expression and 
copy number profiling, including sample assignment to the PAM50 
intrinsic subtypes**” (Supplementary Fig. 2), copy number analysis 
(Supplementary Tables 4-8) and validation (Supplementary Figs 3 and 
4 and Supplementary Tables 9-11), and TP53 mutational profiling 
(Supplementary Fig. 5) are described in the Supplementary Information. 


Genome variation affects tumour expression architecture 


Genomic variants are considered to act in cis when a variant at a locus 
has an impact on its own expression, or in trans when it is associated 
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with genes at other sites in the genome. We generated a map of CNAs, 
CNVs (Supplementary Fig. 6, Supplementary Tables 12-15) and 
single nucleotide polymorphisms (SNPs) in the breast cancer genome 
to distinguish germline from somatic variants (see Methods), and 
to examine the impact of each of these variants on the expression 
landscape. Previous studies* have shown that most heritable gene 
expression traits are governed by a combination of cis (proximal) loci, 
defined here as those within a 3-megabase (Mb) window surrounding 
the gene of interest, and trans (distal) loci, defined here as those 
outside that window. We assessed the relative influence of SNPs, 
CNVs and CNAs on tumour expression architecture, using each of 
these variants as a predictor (see Methods) to elucidate expression 
quantitative trait loci (eQTLs) among patients. 

Both germline variants and somatic aberrations were found to 
influence tumour expression architecture, having an impact on 
>39% (11,198/28,609) of expression probes genome-wide based on 
analysis of variance (ANOVA; see Methods), with roughly equal 
numbers of genes associated in cis and trans. CNAs were associated 
with the greatest number of expression profiles (Fig. 1, Supplementary 
Figs 7-13 and Supplementary Tables 16-20), but were rivalled by 
SNPs to explain a greater proportion of expression variation on a 
per-gene basis genome-wide, whereas the contribution from CNVs 
was more moderate (Fig. 1b and Supplementary Table 21). The true 
ratio of putative trans versus cis eQTLs is hard to estimate’; however, 
the large sample size used here allowed the detection of small effects, 
with 5,401 and 5,462 CNAs significantly (Sidak adjusted P value 
<0.0001) associated in cis or in trans, respectively. Whereas cis- 
associations tended to be stronger, the trans-acting loci modulated 
a larger number of messenger RNAs, as described below. 


Expression outliers refine the breast cancer landscape 


As shown above, ~20% of loci exhibit CNA-expression associations in 
cis (Supplementary Fig. 14). To refine this landscape further and identify 
the putative driver genes, we used profiles of outlying expression (see 
Methods and ref. 10) and the high resolution and sensitivity of the 


Affymetrix SNP 6.0 platform to delineate candidate regions. This 
approach markedly reduces the complexity of the landscape to 45 regions 
(frequency > 5, Fig. 2) and narrows the focus, highlighting novel regions 
that modulate expression. The full enumeration of regions delineated by 
this approach and their subtype-specific associations (Supplementary 
Figs 15 and 16 and Supplementary Tables 22-24) includes both known 
drivers (for example, ZNF703 (ref. 11), PTEN (ref. 12), MYC, CCND1, 
MDM2, ERBB2, CCNE1 (ref. 13)) and putative driver aberrations (for 
example, MDM1, MDM4, CDK3, CDK4, CAMK1D, PI4KB, NCORI1). 

The deletion landscape of breast cancer has been poorly explored, 
with the exception of PTEN. We illustrate three additional regions of 
significance centred on PPP2R2A (8p21, Fig. 2, region 11), MTAP 
(9p21, Fig. 2, region 15) and MAP2K4 (17p11, Fig. 2, region 33), 
which exhibit heterozygous and homozygous deletions (Supplemen- 
tary Figs 15, 17-19 and Supplementary Table 24) that drive expres- 
sion of these loci. We observe breast cancer subtype-specific (enriched 
in mitotic ER-positive cancers) loss of transcript expression in 
PPP2R2A,a B-regulatory subunit of the PP2A mitotic exit holoenzyme 
complex. Somatic mutations in PPP2R1A have recently been reported 
in clear cell ovarian cancers and endometrioid cancers'*!*, and 
methylation silencing of PPP2R2B has also been observed in colorectal 
cancers’®. Thus, dysregulation of specific PPP2R2A functions in luminal 
B breast cancers adds a significant pathophysiology to this subtype. 

MTAP (9p21, a component of methyladenosine salvage) is fre- 
quently co-deleted with the CDKN2A and CDKN2B tumour suppressor 
genes in a variety of cancers’’ as we observe here (Supplementary Figs 
17c and 18). The third deletion encompasses MAP2K4 (also called 
MKK4) (17p11), a p38/Jun dual specificity serine/threonine protein 
kinase. MAP2K4 has been proposed as a recessive cancer gene’®, with 
mutations noted in cell lines'’. We show, for the first time, the recurrent 
deletion of MAP2K4 (Supplementary Figs 17d and 19) concomitant 
with outlying expression (Supplementary Fig. 15) in predominantly 
ER-positive cases, and verify homozygous deletions (Supplementary 
Table 9) in primary tumours, strengthening the evidence for MAP2K4 
as a tumour suppressor in breast cancer. 
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Figure 1 | Germline and somatic variants influence tumour expression 
architecture. a, Venn diagrams depict the relative contribution of SNPs, CNVs 
and CNAs to genome-wide, cis and trans tumour expression variation for 
significant expression associations (Sidak adjusted P-value <0.0001). 
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b, Histograms illustrate the proportion of variance explained by the most 
significantly associated predictor for each predictor type, where several of the 
top associations are indicated. 
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Figure 2 | Patterns of cis outlying expression refine putative breast cancer 
drivers. A genome-wide view of outlying expression coincident with extreme 
copy number events in the CNA landscape highlights putative driver genes, as 
indicated by the arrows and numbered regions. The frequency (absolute count) 
of cases exhibiting an outlying expression profile at regions across the genome is 


Trans-acting associations reveal distinct modules 


We next asked how trans-associated expression profiles are distributed 
across the genome. We mapped these in the expression landscape by 
examining the matrices of CNA-expression associations (see Methods). 
This revealed strong off-diagonal patterns at loci on chromosomes 1q, 
7p, 8, 11q, 14q, 16, 17q and 20g (Fig. 3a), including both positive and 
negative associations, as well as numerous trans-acting aberra- 
tion hotspots (defined as CNAs associated with >30 mRNAs). 
Importantly, these aberration hotspots can be grouped into pathway 
modules, which highlight known driver loci such as ERBB2 and MYC, 
as well as novel loci associated with large trans expression modules 
(Supplementary Tables 25 and 26). The T-cell-receptor (TCR) loci on 
chromosomes 7 (TRG) and 14 (TRA) represent two such hotspots that 
modulated 381 and 153 unique mRNAs, respectively, as well as 19 
dually regulated genes (Supplementary Fig. 20). These cognate 
mRNAs were highly enriched for T-cell activation and proliferation, 
dendritic cell presentation, and leukocyte activation, which indicate 
the induction of an adaptive immune response associated with 
tumour-infiltrating lymphocytes (Fig. 3b, Supplementary Fig. 20 and 
Supplementary Tables 27 and 28), as described later. 

In a second approach, we examined the genome-wide patterns of 
linear correlation between copy number and expression features (see 
Methods), and noted the alignment of several off-diagonal signals, 
including those on chromosome 1q, 8q, 11q, 14q and 16 (Sup- 
plementary Fig. 21). Additionally, a broad signal on chromosome 5 
localizing to a deletion event restricted to the basal-like tumours was 
observed (Supplementary Fig. 21), but was not detected with the 
eQTL framework, where discrete (as opposed to continuous) copy 
number values were used. This basal-specific trans module is enriched 
for transcriptional changes involving cell cycle, DNA damage repair 
and apoptosis (Supplementary Table 29), reflecting the high mitotic 
index typically associated with basal-like tumours, described in detail 
below. 


shown, as is the distribution across subgroups for several regions in the insets. 
High-level amplifications are indicated in red and homozygous deletions in 
blue. Red asterisks above the bar plots indicate significantly different observed 
distributions than expected based on the overall population frequency (7’ test, 
P<0,0001). 


Integrative clustering reveals novel subgroups 


Using the discovery set of 997 breast cancers, we next asked whether 
novel biological subgroups could be found by joint clustering of copy 
number and gene expression data. On the basis of our finding that cis- 
acting CNAs dominated the expression landscape, the top 1,000 cis- 
associated genes across all subtypes (Supplementary Table 30) were 
used as features for input to a joint latent variable framework for 
integrative clustering” (see Methods). Cluster analysis suggested 10 
groups (based on Dunn’s index) (see Methods and Supplementary Figs 
22 and 23), but for completeness, this result was compared with the 
results for alternative numbers of clusters and clustering schemes (see 
Methods, Supplementary Figs 23-27 and Supplementary Tables 31- 
33). The 10 integrative clusters (labelled IntClust 1-10) were typified 
by well-defined copy number aberrations (Fig. 4, Supplementary Figs 
22, 28-30 and Supplementary Tables 34-39), and split many of the 
intrinsic subtypes (Supplementary Figs 31-33). Kaplan-Meier plots of 
disease-specific survival and Cox proportional hazards models indicate 
subgroups with distinct clinical outcomes (Fig. 5, Supplementary Figs 
34, 35 and Supplementary Tables 40 and 41). To validate these results, 
we trained a classifier (754 features) for the integrative subtypes in the 
discovery set using the nearest shrunken centroids approach’' (see 
Methods and Supplementary Tables 42 and 43), and then classified 
the independent validation set of 995 cases into the 10 groups 
(Supplementary Table 44). The reproducibility of the clusters in the 
validation set is shown in three ways. First, classification of the valid- 
ation set resulted in the assignment of a similar proportion of cases to 
the 10 subgroups, each of which exhibited nearly identical copy number 
profiles (Fig. 4). Second, the groups have substantially similar hazard 
ratios (Fig. 5b, Supplementary Fig. 35 and Supplementary Table 40). 
Third, the quality of the clusters in the validation set is emphasized by 
the in-group proportions (IGP) measure” (Fig. 4). 

Among the integrative clusters, we first note an ER-positive sub- 
group composed of 11q13/14 cis-acting luminal tumours (IntClust 2, 
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Figure 3 | Trans-acting aberration hotspots modulate concerted molecular 
pathways. a, Manhattan plot illustrating cis and trans expression-associated 
copy number aberrations from the eQTL analysis (top panel). The matrix of 
significant predictor-expression associations (adjusted P-value =0.0001) 
exhibits strong off-diagonal patterns (middle panel), and the frequency of 
mRNAs associated with a particular copy number aberration further illuminates 
these trans-acting aberration hotspots (bottom panel). The directionality of the 
associations is indicated as follows: cis: positive, red; negative, pink; trans: 
positive, blue; negative, green. b, Enrichment map of immune response modules 
in the trans-associated TRA network, where letters in parentheses represent the 
source database as follows: b, NCI-PID BioCarta; c, cancer cell map; k, KEGG; 
n, NCI-PID curated pathways; p, PANTHER; r, Reactome. 


n=45) that harbour other common alterations. This subgroup 
exhibited a steep mortality trajectory with elevated hazard ratios 
(discovery set: 3.620, 95% confidence interval (1.905-6.878); valid- 
ation set: 3.353, 95% confidence interval (1.381-8.141)), indicating 
that it represents a particularly high-risk subgroup. Several known 
and putative driver genes reside in this region, namely CCND1 
(11q13.3), EMSY (11q13.5), PAKI (11q14.1) and RSF1 (11q14.1), 
which have been previously linked to breast'*”* or ovarian cancer”. 
Both the copy number (Fig. 4) and expression outlier landscapes 
(Fig. 2) suggest at least two separate amplicons at 11q13/14, one at 
CCND1 (11q13.3) anda separate peak from 11q13.5-11q14.1 spanning 
UVRAG-GAB2, centred around PAK1, RSF1, Cllorf67 and INTS4, 
where it is more challenging to distinguish the driver**. Notably, the 
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expression outlier profiles for this region are enriched for samples 
belonging to IntClust 2 (Fig. 2, inset region 23) and all 45 members 
of this subgroup harboured amplifications of these genes, with high 
frequencies of amplification also observed for CCND1 (n = 39) and 
EMSY (n = 34). In light of these observations, the 11q13/14 amplicon 
may be driven by a cassette of genes rather than a single oncogene. 

Second, we note the existence of two subgroups marked by a paucity 
of copy number and cis-acting alterations. These subgroups cannot be 
explained by low cellularity tumours (see Methods). One subgroup 
(IntClust3, n = 156) with low genomic instability (Fig. 4 and Sup- 
plementary Fig. 22) was composed predominantly of luminal A cases, 
and was enriched for histotypes that typically have good prognosis, 
including invasive lobular and tubular carcinomas. The other sub- 
group (IntClust 4, n = 167) was also composed of favourable outcome 
cases, but included both ER-positive and ER-negative cases and varied 
intrinsic subtypes, and had an essentially flat copy number landscape, 
hence termed the ‘“CNA-devoid’ subgroup. A significant proportion of 
cases within this subgroup exhibit extensive lymphocytic infiltration 
(Supplementary Table 45). 

Third, several intermediate prognosis groups of predominantly 
ER-positive cancers were identified, including a 17q23/20q cis-acting 
luminal B subgroup (IntClust 1, n = 76), an 8p12 cis-acting luminal 
subgroup (IntClust 6, n= 44), as well as an 8q cis-acting/20q- 
amplified mixed subgroup (IntClust 9, n = 67). Two luminal A sub- 
groups with similar CNA profiles and favourable outcome were 
noted. One subgroup is characterized by the classical 1q gain/16q loss 
(IntClust 8, n = 143), which corresponds to a common translocation 
event”, and the other lacks the 1q alteration, while maintaining the 
16p gain/16q loss with higher frequencies of 8q amplification 
(IntClust 7, n = 109). We also noted that the majority of basal-like 
tumours formed a stable, mostly high-genomic instability subgroup 
(IntClust 10, n = 96). This subgroup had relatively good long-term 
outcomes (after 5 years), consistent with ref. 26, and characteristic cis- 
acting alterations (5 loss/8q gain/10p gain/12p gain). 

The ERBB2-amplified cancers composed of HER2-enriched (ER- 
negative) cases and luminal (ER-positive) cases appear as IntClust 5 
(n = 94), thus refining the ERBB2 intrinsic subtype by grouping addi- 
tional patients that might benefit from targeted therapy. Patients in 
this study were enrolled before the general availability of trastuzumab, 
and as expected this subgroup exhibits the worst disease-specific sur- 
vival at both 5 and 15 years and elevated hazard ratios (discovery set: 
3.899, 95% confidence interval (2.234-6.804); validation set: 4.447, 
95% confidence interval (2.284-8.661)). 


Pathway deregulation in the integrative subgroups 


Finally, we projected the molecular profiles of the integrative sub- 
groups onto pathways to examine possible biological themes among 
breast cancer subgroups (Supplementary Tables 46 and 47) and the 
relative impact of cis and trans expression modules on the pathways. 
The CNA-devoid (IntClust 4) group exhibits a strong immune and 
inflammation signature involving the antigen presentation pathway, 
OX40 signalling, and cytotoxic T-lymphocyte-mediated apoptosis 
(Supplementary Fig. 36). Given that trans-acting deletion hotspots 
were localized to the TRG and TRA loci and were associated with 
an adaptive immune response module, we asked whether these dele- 
tions contribute to alterations in this pathway. The CNA-devoid sub- 
group (IntClust 4) was found to exhibit nearly twice as many deletions 
(typically heterozygous loss) at the TRG and TRA loci (~20% of cases) 
as compared to the other subtypes (with the exception of IntClust 10), 
and deletions of both TCR loci were significantly associated with 
severe lymphocytic infiltration (7 test, P<10? and P<10 °, 
respectively). Notably, these trans-associated mRNAs were signifi- 
cantly enriched in the immune response signature of the CNA-devoid 
subgroup (Supplementary Fig. 36) as well as among genes differentially 
expressed in CNA-devoid cases with severe lymphocytic infiltration 
(Supplementary Fig. 37). We conclude that genomic copy number loss 
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Figure 4 | The integrative subgroups have distinct copy number profiles. 
Genome-wide frequencies (F, proportion of cases) of somatic CNAs (y-axis, 
upper plot) and the subtype-specific association (-log)9 P-value) of aberrations 
(y-axis, bottom plot) based on a 7” test of independence are shown for each of 
the 10 integrative clusters. Regions of copy number gain are indicated in red 
and regions of loss in blue in the frequency plot (upper plot). Subgroups were 


at the TCR loci drives a trans-acting immune response module that 
associates with lymphocytic infiltration, and characterizes an otherwise 
genomically quiescent subgroup of ER-positive and ER-negative 
patients with good prognosis. These observations suggest the presence 
of mature T lymphocytes (with rearranged TCR loci), which may 
explain an immunological response to the cancer. In line with these 
findings, a recent study”’ demonstrated the association between CD8* 
lymphocytes and favourable prognosis. 

Also among the trans-influenced groups is IntClust 10 (basal-like 
cancer enriched subgroup), which harbours chromosome 5q dele- 
tions (Supplementary Fig. 21). Numerous signalling molecules, tran- 
scription factors and cell division genes were associated in trans with 
this deletion event in the basal cancers, including alterations in 
AURKB, BCL2, BUB1, CDCA3, CDCA4, CDC20, CDC45, CHEK1, 
FOXM1, HDAC2, IGF1R, KIF2C, KIFC1, MTHFDIL, RAD51AP1, 
TTK and UBE2C (Supplementary Fig. 38). Notably, TTK (MPS1), a 
dual specificity kinase that assists AURKB in chromosome alignment 
during mitosis, and recently reported to promote aneuploidy in breast 
cancer’*, was upregulated. These results indicate that 5q deletions 
modulate the coordinate transcriptional control of genomic and 
chromosomal instability and cell cycle regulation within this subgroup. 

In contrast to these subtype-specific trans-associated signatures, 
the high-risk 11q13/14 subgroup was characterized by strong 
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ordered by hierarchical clustering of their copy number profiles in the discovery 
cohort (n = 997). For the validation cohort (n = 995), samples were classified 
into each of the integrative clusters as described in the text. The number of cases 
in each subgroup (1) is indicated as is the in-group proportion (IGP) and 
associated P-value, as well as the distribution of PAM50 subtypes within each 
cluster. 


cis-acting associations. Like the basal cancers, this subgroup also 
exhibited alterations in key cell-cycle-related genes (Supplementary 
Fig. 39), which probably have a role in its aggressive pathophysiology, 
but the nature of the signature differs. In particular, the regulation of 
the G1/S transition by BTG family proteins, which include CCND1, 
PPP2RIB and E2F2, was significantly enriched in the 11q13/14 cis- 
acting subgroup, but not the basal cancers, and this is consistent with 
CCND1 and the PPP2R subunit representing subtype-specific drivers 
in these tumours. 


Discussion 

We have generated a robust, population-based molecular subgroup- 
ing of breast cancer based on multiple genomic views. The size and 
nature of this cohort made it amenable to eQTL analyses, which can 
aid the identification of loci that contribute to the disease phenotype’. 
CNAs and SNPs influenced expression variation, with CNAs 
dominating the landscape in cis and trans. The joint clustering of 
CNAs and gene expression profiles further resolves the considerable 
heterogeneity of the expression-only subgroups, and highlights a 
high-risk 11q13/14 cis-acting subgroup as well as several other strong 
cis-acting clusters and a genomically quiescent group. The reproducibility 
of subgroups with these molecular and clinical features in a validation 
cohort of 995 tumours suggests that by integrating multiple genomic 
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Figure 5 | The integrative subgroups have distinct clinical outcomes. 

a, Kaplan-Meier plot of disease-specific survival (truncated at 15 years) for the 
integrative subgroups in the discovery cohort. For each cluster, the number of 
samples at risk is indicated as well as the total number of deaths (in 
parentheses). b, 95% confidence intervals for the Cox proportional hazard 
ratios are illustrated for the discovery and validation cohort for selected values 
of key covariates, where each subgroup was compared against IntClust 3. 


features it may be possible to derive more robust patient classifiers. We 
show here, for the first time, that subtype-specific trans-acting aberra- 
tions modulate concerted transcriptional changes, such as the TCR 
deletion-mediated adaptive immune response that characterizes the 
CNA-devoid subgroup and the chromosome 5 deletion-associated cell 
cycle program in the basal cancers. 

The integrated CNA-expression landscape highlights a limited 
number of genomic regions that probably contain driver genes, 
including ZNF703, which we recently described as a luminal B specific 
driver"', as well as somatic deletion events affecting key subunits of the 
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PP2A holoenzyme complex and MTAP, which have previously been 
under-explored in breast cancer. The CNA-expression landscape also 
illuminates rare but potentially significant events, including IGF1R, 
KRAS and EGFR amplifications and CDKN2B, BRCA2, RB1, ATM, 
SMAD4, NCORI and UTX homozygous deletions. Although some of 
these events have low overall frequencies (<1% patients) (Figs 2, 
Supplementary Fig. 15 and Supplementary Tables 22-24), they may 
have implications for understanding therapeutic responses to targeted 
agents, particularly those targeting tyrosine kinases or phosphatases. 

Finally, because the integrative subgroups occur at different 
frequencies in the overall population, focusing sequencing efforts 
on representative numbers from these groups will help to establish 
a comprehensive breast cancer somatic landscape at sequence-level 
resolution. For example, a significant number (~17%, n = 167 in the 
discovery cohort) of breast cancers are devoid of somatic CNAs, and 
are ripe for mutational profiling. Our work provides a definitive 
framework for understanding how gene copy number aberrations 
affect gene expression in breast cancer and reveals novel subgroups 
that should be the target of future investigation. 


METHODS SUMMARY 


All patient specimens were obtained with appropriate consent from the relevant 
institutional review board. DNA and RNA were isolated from samples and 
hybridized to the Affymetrix SNP 6.0 and Illumina HT-12 v3 platforms for 
genomic and transcriptional profiling, respectively. A detailed description of 
the experimental assays and analytical methods used to analyse these data are 
available in the Supplementary Information. 
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Live-cell delamination counterbalances epithelial 
growth to limit tissue overcrowding 


Eliana Marinari’*, Aida Mehonic”*, Scott Curran’, Jonathan Gale*, Thomas Duke? & Buzz Baum! 


The development and maintenance of an epithelium requires 
finely balanced rates of growth and cell death. However, the mech- 
anical and biochemical mechanisms that ensure proper feedback 
control of tissue growth’ *, which when deregulated contribute to 
tumorigenesis, are poorly understood. Here we use the fly notum as 
a model system’ to identify a novel process of crowding-induced 
cell delamination that balances growth to ensure the development 
of well-ordered cell packing. In crowded regions of the tissue, 
a proportion of cells undergo a serial loss of cell-cell junctions 
and a progressive loss of apical area, before being squeezed out 
by their neighbours. This path of delamination is recapitulated 
by a simple computational model of epithelial mechanics, in which 
stochastic cell loss relieves overcrowding as the system tends 
towards equilibrium. We show that this process of delamination 
is mechanistically distinct from apoptosis-mediated cell extrusion’ * 
and precedes the first signs of cell death. Overall, this analysis 
reveals a simple mechanism that buffers epithelia against variations 
in growth. Because live-cell delamination constitutes a mechanistic 
link between epithelial hyperplasia and cell invasion, this is likely to 
have important implications for our understanding of the early 
stages of cancer development. 

To better understand the role of epithelial cell dynamics and 
mechanics in the development of a well-ordered tissue, we studied the 
refinement of the fly notum”’ from 12h after pupariation (AP), as the 
tissue undergoes a final process of topological reorganization before 
terminal differentiation. The refinement of cell packing in this tissue 
occurs largely without concomitant changes in overall tissue form or 
size*’. Using the constitutive expression of E-cadherin-GFP to follow 
apical junctional dynamics (Supplementary Movie 1), we observed a 
significant level of regionalized epithelial cell delamination in the tissue. 
In the midline region, where cells have an anisotropic geometry at 12h 
AP, 32.5 + 13.6% of cells underwent basal delamination, compared with 
0.5 + 0.3% of cells in the surrounding tissue (means + s.d.,n = 4 pupae) 
(Fig. 1c). The pattern of delamination varied between animals, was not 
symmetric across the midline, and involved a variable proportion of 
cells leaving the tissue both before and after cell division (Fig. 1b). This 
rules out a deterministic role for cell lineage, position or developmental 
time in delamination, implying that it is a stochastic process. Before the 
onset of delamination, cells within the midline of the tissue were 2.7-fold 
longer than they were wide when measured along the anterior—posterior 
axis of the animal, despite their having similar apical areas 
(48.5 + 14.2um* in the midline and 51.3 + 12.741m* outside the 
midline). By 26h AP, however, midline cells were isotropic in form 
and could no longer be distinguished from those in the rest of the 
tissue (Fig. la). Regionalized delamination therefore accompanies the 
topological rearrangement as the tissue tends towards a uniform cell 
organization. This suggests a possible role for delamination in the 
refinement of this tissue before terminal differentiation. 

Although it is not currently known how epithelial growth is regulated 
during development or homeostasis, mechanical stresses generated as 
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Figure 1 | Stochastic cell delamination from crowded regions of a tissue 
during normal development. a, E-cadherin—GFP notum at 14h and 26h AP. 
b, Cells delaminating between 14h and 26h AP were coloured according to 
their behaviour (see key). c, The percentage of cells delaminating in the midline 
(M) and outside the midline (OM), calculated as the ratio between the number 
of cells that delaminated in 11h and the total initial number of cells from four 
animals. d, Tissue expressing E-cadherin-GFP and His-RFP (to mark nuclei) 
along different planes in z. Midline nuclei are indicated by arrowheads and 
those outside the midline by arrows. e, Box plot showing the z-depth of nuclei 
centroids in M and OM. The median is represented by horizontal lines; the 75th 
and 25th centiles are at the top and bottom of the boxes, respectively. 

f, Quantification and kymograph analysis of average initial velocity of vertex 
displacement after laser cutting for early pupae (14-16 h AP). Images show the 
junction before the cut at 0s and the kymograph after the cut. Dotted lines 
indicate 10s after cutting. The average initial velocity does not correlate with 
junction length (Supplementary Fig. 2b-d). g, Quantification of the average 
initial velocity of vertex displacement of single junctions after laser ablation for 
late pupae (24-26 h AP). In fand g, n represents 15 cuts for 5 animals. All error 
bars indicate s.d. Scale bars, 10 um. 
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a consequence of cell proliferation have been proposed to have a func- 
tion’. Several lines of evidence suggest that cells across the developing 
notum are subject to regional differences in forces that could, through 
mechanical feedback, lead to local differences in cell behaviour. First, at 
14h AP, nuclei in the midline region are found at variable depths 
along the zaxis (Fig. 1d,e), whereas nuclei outside the midline lie in 
the same confocal plane (Fig. 1d, e). At the same time, cell-cell junc- 
tions within the midline region seem convoluted, whereas those out- 
side the midline seem taut (Supplementary Fig. 1a). Furthermore, 
when junctions are cut with a diffraction limited laser, there is a 
threefold difference in the initial velocity of the junctional recoil in 
these two regions. Thus, the vertices of severed junctions outside the 
midline open with an initial velocity of 0.06 + 0.02 1m s |, whereas 
vertices in the midline remain relatively unmoved (0.02 + 0.03 jim s~ I 
Fig. 1f and Supplementary Fig. 2a). In contrast, by 26h AP, when 
nuclear and junctional morphology seem homogeneous across the 
tissue (Supplementary Fig. 1b), rates of vertex displacement are 
similar within the midline (0.12+0.06ums ') and outside it 
(0.12 + 0.05 ums’; Fig. 1g and Supplementary Fig. 2a). These differ- 
ences in tissue mechanics were confirmed by using different patterns of 
laser-induced dissection (Supplementary Figs 3 and 4). Because local 
delamination rates correlate with local differences in mechanics across 
the tissue, delamination could relieve local overcrowding to help 
generate the regular cell packing and uniform tissue mechanics seen 
at the end of development. 

To test this hypothesis, we used RNA-mediated interference (RNAi) 
to determine whether rates of cell delamination are sensitive to 
changes in cell crowding. We began by silencing components of the 
phosphatidylinositol-3-OH kinase (PI(3)K) and Hippo pathways 
(Fig. 2a, b and Supplementary Fig. 5). These treatments increased cell 
growth without significantly increasing tissue area, resulting in an 
increase in epithelial height and a corresponding increase in the rate 
of tissue-wide cell delamination (Fig. 2a-d and Supplementary 
Movie 2). Rates of epithelial cell delamination were highest in the 
midline and in crowded epithelial folds that formed as the result of 
tissue buckling (Fig. 2g). Conversely, rates of midline cell delamination 
were significantly decreased when growth and crowding were inhibited 
(Fig. 2a—d, f and Supplementary Movies 3 and 4). Moreover, in tissue 
expressing high levels of Tscl and T’sc2 the decrease in growth was 
accompanied by a corresponding increase in junctional tension across 
the tissue, measured by vertex displacement after cutting (Fig. 2e), as 
expected if crowding affects tissue mechanics. This marked decrease in 
the rate of delamination seen in tissues with decreased growth was not 
a simple consequence of the accompanying decrease in cell division, 
because delamination continued unabated when cell division was 
directly inhibited through the depletion of Cdc25/String (Supplemen- 
tary Fig. 6 and Supplementary Movie 5). Furthermore, differences in 
the rate of delamination could not be explained by growth-induced 
changes in developmental timing, because similar results were seen 
when we normalized on the basis of the timing of P1 cell divisions 
(Supplementary Fig. 7). These data suggest that local tissue mechanics 
influence the tendency of cells to delaminate. 

To gain insight into the potential role of mechanics in driving cell 
delamination within a crowded epithelium, we developed a computa- 
tional model of the notum (details in Supplementary Information) in 
which the topological organization of cells changed over time as the 
result of junctional rearrangements while overall tissue area remained 
fixed (Figs 3a and 4a). First, we used the model to test the likely effects 
of tissue growth on dynamic cell organization, through the introduc- 
tion of a variable crowding parameter , equal to the ratio of the area 
that cells would naturally prefer to occupy relative to the fixed tissue 
area (details in Supplementary Information). In simulations, delami- 
nation was directly correlated with crowding, such that high y favoured 
delamination by forcing cells to compete for limited space (Fig. 3b). 
We then examined the effects of cell geometry on delamination by 
running simulations in which cells either had an isotropic shape or an 
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Figure 2 | Delamination counter-balances tissue growth. a, Average rates of 
delamination (n = 3 pupae) for cells in the midline and outside midline in 
control animals, in tissues where growth was increased (p110 PI(3)K, PTEN 
RNAi) and in nota with decreased growth (p110 PI(3)K RNAi, Tscl and Tsc2). 
b, Cells delaminating were coloured according to their behaviour (see inset). 
c, xz axis of tissues stained with 4’ ,6-diamidino-2-phenylindole and labelled 
phalloidin. d, Average tissue height (n = 20 different regions from 4 different 
pupae). e, Quantification of average initial velocity of vertex displacement after 
laser dissection of single junctions in early pupae for tissues expressing Tscl 
and Tsc2, with n representing 15 cuts from 5 pupae. f, Midline geometry for 
nota at 14h AP. g, xy image planes for different z-slices. Delamination is most 
prevalent in tissue folds after p110 PI(3)K overexpression (coloured white in 
the z projection). All error bars indicate s.d. P values were calculated relative to 
the control. Scale bars, 10 um. 


average initial aspect ratio of 2.7:1, to model cells of the notum midline. 
This showed that cellular anisotropy promoted delamination (Fig. 3c), 
which in turn homogenized cell packing (Figs 1a and 3c). Significantly, 
by combining the two factors, crowding and geometry, we were able to 
accurately replicate the behaviours of cells within the midline (with 
anisotropic geometry and increased crowding) and outside the midline 
(isotropic geometry and no growth-induced crowding) using a single 
set of fixed parameters (Fig. 3d). 

A closer examination of simulations revealed the typical path of cell 
delamination. Spontaneous fluctuations in the length of individual 
cell-cell junctions cause a subset of cells to suffer a stochastic serial 
loss of junctions through neighbour exchange events’ (Fig. 3e). The 
concomitant progressive loss in apical area results in the removal of 
these cells from the simulated tissue. In view of this computational 
analysis, when we examined the precise path of cell delamination in the 
developing fly notum we identified two mechanistically distinct paths 
of delamination. First, we identified a population of cells that leave the 
tissue over an extended and variable period of time as they undergo a 
consecutive series of neighbour exchange events'*”’, accompanied by a 
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Figure 3 | Topological rearrangements drive a stochastic process of 
delamination in overcrowded tissues. a, Schematic diagram of the model of 
mechanical forces that act in the plane of the epithelium: junctional line tension 
(A), contractility of cortical acto-myosin (J’) and limited cell compressibility 
(K) based on a target area (Ag). The tissue evolves by small stochastic 
displacements of the vertices, biased towards changes that decrease the work 
function, W. b, Cumulative proportion of cells delaminating from wild-type 
tissue (y = 2) and a tissue with high (y = 4) or low (y = 1) crowding as a 
function of time, all three with anisotropic geometry. c, Still images showing 
initial and final cell configurations for the midline simulation (anisotropic); 
values of parameters correspond to moderate crowding (y = 2 in both cases). 
d, Final proportions of cells delaminating in simulations of wild-type tissue 
within and outside the midline. The midline has anisotropic geometry and 

y = 2; tissue outside the midline has isotropic geometry and y = 1. e, An 
example showing the serial loss of junctions and the progressive loss of apical 
area that precedes cell extrusion. In b, c and d mean values and standard errors 
of the data are averaged over six simulations. Grey shades represent standard 
errors of the data. 


progressive loss of junctions and apical area (Fig. 4a, b). We term this 
type of delamination a D1 process. Once these cells had fewer than five 
neighbours and less than ~25% of their initial area, they were forced 
out of the tissue within a short and reproducible timescale (about 
10min). This final push was associated with the recruitment of a 
contractile myosin II ring that formed within neighbouring cells 
(Fig. 4g). When the actomyosin cable was cut, both cell delamination 
and delamination-induced cell death were prevented (Supplementary 
Fig. 8). In addition, we identified a second pathway, which we term a 
D2 process, by which cells delaminated from the epithelium over a 
period of about 50 min, as the result of a progressive loss of apical area, 
without a concomitant change in neighbour relationships (Fig. 4c). 
This D2 process resembled apoptotic cell extrusion as described previ- 
ously® in that it was associated with the formation of rosette-like 
intermediates and the loss of junctional E-cadherin (Fig. 4g and 
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Figure 4 | Crowding-induced delamination is a cause not a consequence of 
cell death and is distinct from apoptosis-mediated extrusion. a, Changing 
neighbours by means of a T1 transition in the model and in vivo. b, A typical 
midline wild-type cell delaminating (yellow perimeter) and a plot to show the 
relative timing of the loss of apical area and neighbours as it leaves the tissue. 
Junctions disappearing between frames are indicated in red. c, A cell (yellow 
perimeter) delaminating from the midline in the wild-type tissue by means of 
rosette formation. d, A delaminating cell (yellow perimeter) after ultraviolet 
(UV) irradiation. e, Percentage midline delamination (n = 3 pupae) for control 
pupae and after DIAP1 overexpression. Error bars indicate s.d. f, Plots showing 
the area (A) of delaminating cells (n = 25) after they have lost two junctions (or 
one junction in the case of five-sided cells) as a fraction of their initial area (Aj). 
Cells above the dotted line delaminate by losing junctions; cells below the 
dotted line delaminate through the formation of rosette-like intermediates. 

g, Control and ultraviolet-irradiated tissues marked with E-cadherin-GFP and 
MyollI-Cherry (mCh). The cytoplasmic myosin II ring is indicated with arrows. 
In the ultraviolet-irradiated tissue the arrowheads indicate E-cadherin loss and 
junctional myosin II accumulation. Numbers refer to the time in minutes 
before delamination. Scale bars, 10 tm. 


Supplementary Fig. 9). Moreover, this D2 process was actively induced 
by ultraviolet irradiation (Fig. 4d, f, g). To test whether D1 and/or D2 
processes were triggered by dying cells in the wild-type notum, we then 
monitored cell behaviour in nota that overexpressed an inhibitor of 
apoptosis, DIAP1 (ref.12). This led to a small but reproducible 
decrease in the overall rate of delamination as the result of a selective 
inhibition of D2-type delamination events (Fig. 4e, f and Supplemen- 
tary Movie 6). Therefore, although a subset of cells are extruded from 
the notum as a direct consequence of cell death'’ by means of a D2-type 
process, the vast majority of cells leave crowded regions of the tissue 
while alive, by means of a series of stochastic neighbour exchange 
events (D1). These live delaminated cells then die in a process akin to 
anoikis, their nuclei becoming visibly condensed and_pyknotic 
(Supplementary Fig. 10a) before they are cleared from underneath 
the epithelium by circulating macrophages (Supplementary Fig. 10b). 
We confirmed that death is a consequence not the cause of D1-type 
delamination events'* by showing that midline cell delamina- 
tion continued unabated in nota expressing RNAi targeting JNK, p53 
or Flower (Supplementary Fig. 11) and after p35 expression (data not 
shown). Finally, we verified that only a subset of cells delaminated from 
the midline during the entire course of normal development, as 
expected for a stochastic process (Supplementary Fig. 12). 

Taken together, these data identify a previously unknown two-step 
process of live-cell delamination, which contributes to the refinement 
of epithelial cell packing at the end of development. We show that this 
process of crowding-induced delamination can be recapitulated by 
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using a simple mechanical model of a epithelium in which cells 
compete for space that includes, first, stochastic junctional fluctuations 
that are capable of triggering neighbour exchange events, and, second, 
a final push that can remove cells with small apices from the tissue. 
Because many epithelia are both crowded and topologically active, we 
expect crowding-induced delamination to be a common process that 
buffers tissue growth to aid normal tissue homeostasis in a wide variety 
of systems. This study also reveals a potential mechanistic link between 
tissue hyperplasia and delamination, which may have important con- 
sequences for our understanding of cancer because it suggests that the 
spread of a primary epithelial tumour will be limited not by a switch 
from an epithelial to a mesenchymal state but simply by the tendency 
of delaminating cells to die™. 


METHODS SUMMARY 


For this study, fluorescently marked pnr-Gal4 pupae expressing different trans- 
genes or RNAi constructs were imaged between 14h and 26h AP. Three or more 
animals were used for each experiment and, when data were quantified, signifi- 
cance was established with an unpaired t-test. For ultraviolet irradiation, pupae 
were dissected at 15h AP and the exposed notum was irradiated with a Stratalinker 
1800 crosslinker at 100 mJ cm’, before live imaging. For laser ablation of single 
junctions at 14h and 24h AP, the tissue was imaged with 488-nm light from an 
Ar-Kr laser every 2s and ablated with 730-nm multiphoton excitation from a 
Chameleon-XR Ti-Sapphire laser (AIM, Zeiss). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Fly stocks and genetics. The following stocks were used: pnr-GAL4 
(Bloomington:3039), E-cadherin-GEP (ubi-DE-cad-GFP)"*, E-cadherin-GFP His- 
REP (III) (ubi-DE-cad-GFP/CyO;His-RFP/TM6B), Act-5c-GAL4 (Bloomington: 
25374), p1l0 overexpression (w;UAS-myc-Dp110)'°, DIAP1 overexpression (III) 
(if/CyO; UAS-DIAP1), Tscl and Tsc2 overexpression (III) (hsFLP;;UAS-Tsc1,Tsc2), 
Resille::GFP'’. E-cadherin-GFP was crossed with the pnr-GAL4 line (ubi-DE-cad- 
GFP;pnr-GAL4/TM6B) to drive gene or IR expression. For IR expression, males 
from each UAS-IR fly line were crossed with Ecad-GFP; pnr-GAL4 virgin females. 
Crosses between UAS-hairpin RNAi males and Ecadherin-GEP; pnr-GAL4 females 
were raised at 22 °C and shifted to 29 °C during larval stages. RNAi lines were used 
to silence the expression of the following genes: cdc25 (ID17760), fwe (ID104993), 
bsk (ID34138), pI3k (ID38985), pten (ID35731), r5 s (ID27792), warts (ID9928). All 
RNAi lines came from the Vienna Drosophila RNAi Center (VDRC) library’. 
Live imaging and dissections. Live imaging’ was performed by cutting a window 
in the pupal case attached to a slide with double-sided tape, and placing a coverslip 
carrying a drop of injection oil over the notum, supported by coverslips at either end. 
Nota were imaged from 14h AP for 12h during development. For stained tissues, 
nota from 16h AP pupae were dissected in PBS for direct fixation in 4% formaldehyde 
for 20 min before being permeabilized with PBS containing 0.1% Triton X-100 (PBT). 
The nota were then stained for 2h at 25°C with 4',6-diamidino-2-phenylindole 
(Sigma) for nuclei and Alexa-568 phalloidin (Invitrogen) for actin. The acquisition 
was performed with a Leica SP2 laser scanning confocal microscope with 40X/1.3 
numerical aperture oil objective for both fixed and live-imaging experiments; three or 
more animals were used for each experiment and, when data were quantified, 
significance was established with an unpaired t-test in Excel. 

Ultraviolet irradiation. Pupae were dissected at 15h AP and the exposed notum 
was irradiated with a Stratalinker 1800 crosslinker at 100 mJcm 7, before live 
imaging. 
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Laser ablation. Single junctions were ablated in pupae at 14-16 h (early pupae) 
and 24-26h (late pupae), using 730-nm multiphoton excitation from a 
Chameleon-XR Ti-Sapphire laser, under software control (AIM, Zeiss). A 
3 X 3-pixel (pixel size 0.17 um) region of the junction was irradiated at slow scan 
speed (pixel dwell time 2.56 1s) at 25% laser power, under software control. The 
power required to ablate junctions was determined empirically each time and was 
used in each experiment (AIM, Zeiss). Ablations were triggered during time-lapse 
imaging (every 2s), using 488-nm light from an Ar-Kr laser coupled to a Zeiss 
510NLO upright confocal microscope and a 40X/1.3 numerical aperture oil 
objective. 

Image processing and analysis. The images presented were processed with 
ImageJ (http://rsb.info.nih.gov/ij/) and Adobe Photoshop CS (Adobe Systems, 
Inc.). Unless otherwise noted, images of the tissue represent maximum intensity 
z-projections, and sections were 0.9 |1m apart. For laser ablation, the displacement 
of vertices was measured after a cut with the Image] Kymograph plugin. Statistics 
were calculated in each case on the basis of 15 junctions in 5 different pupae. 
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Restoration of grasp following paralysis through 
brain-controlled stimulation of muscles 


C. Ethier’, E. R. Oby!, M. J. Bauman? & L. E. Miller’? 


Patients with spinal cord injury lack the connections between brain 
and spinal cord circuits that are essential for voluntary movement. 
Clinical systems that achieve muscle contraction through func- 
tional electrical stimulation (FES) have proven to be effective in 
allowing patients with tetraplegia to regain control of hand move- 
ments and to achieve a greater measure of independence in daily 
activities’. In existing clinical systems, the patient uses residual 
proximal limb movements to trigger pre-programmed stimulation 
that causes the paralysed muscles to contract, allowing use of one or 
two basic grasps. Instead, we have developed an FES system in 
primates that is controlled by recordings made from micro- 
electrodes permanently implanted in the brain. We simulated 
some of the effects of the paralysis caused by C5 or C6 spinal cord 
injury’ by injecting rhesus monkeys with a local anaesthetic to 
block the median and ulnar nerves at the elbow. Then, using 
recordings from approximately 100 neurons in the motor cortex, 
we predicted the intended activity of several of the paralysed 
muscles, and used these predictions to control the intensity of 
stimulation of the same muscles. This process essentially bypassed 
the spinal cord, restoring to the monkeys voluntary control of 
their paralysed muscles. This achievement is a major advance 
towards similar restoration of hand function in human patients 
through brain-controlled FES. We anticipate that in human 
patients, this neuroprosthesis would allow much more flexible 
and dexterous use of the hand than is possible with existing FES 
systems. 

Worldwide, over 130,000 people each year survive spinal cord injury 
(SCI) but sustain extensive paralysis*. Approximately half of these 
injuries occur above the sixth cervical vertebra, thereby affecting all 
four limbs. Most of these patients indicate that regaining the ability to 
grasp objects would provide the greatest practical benefit compared to 
regaining other lost functions’. 

For this reason, considerable effort has been devoted to the develop- 
ment of FES systems to restore voluntary grasp'*°. These systems rely 
on residual movement or muscle activity to control electrical activa- 
tion of hand muscles. Because of the complexity of the necessary 
patterns of muscle activation, current FES systems produce only one 
or two grasps using pre-programmed stimulus trains that must be 
customized for each user’. This is effective because many objects can 
be grasped adequately with only palmar or pinch grasp. However, 
normal hand use is much more complex than this. Furthermore, using 
the motion of one body part to control that of another inevitably 
increases the associated cognitive burden. If FES is to provide hand 
movements that are close to normal, a more natural control signal of 
higher dimension than that available through residual motion will be 
necessary. 

Fortunately, the rapid development of the brain machine interface 
(BMI) provides promising new means by which more flexible and 
dexterous movements might be controlled. However, despite the initial 
demonstration of strong force-related discharge in the primary motor 


cortex (M1)*, virtually all existing BMIs extract only kinematic 
information from the brain. This bias is ironic, as the first study to 
decode signals from simultaneously recorded neurons found that force 
was more strongly represented than movement in M1 (ref. 9). 

Only a small number of groups have pursued the possibility of using 
kinetic (force-related) information as a real-time control signal for a 
BMI, through the prediction of grasp force’*"’, joint torque’’ or muscle 
activity’®'*"*. We showed previously that despite paralysis produced 
by peripheral nerve block, monkeys could accurately modulate the 
magnitude of isometric flexion and extension wrist torque using cor- 
tically controlled FES'*’*’”, Related results were also reported by a 
group that operantly conditioned monkeys to modulate the activity 
of one or two individual neurons whose discharge directly controlled 
stimulation of individual muscles"*. 

We performed the current experiments with two monkeys trained 
to pick up weighted rubber balls and to convey them to an opening at 
the top of a dispenser (Fig. 1). After training, each monkey was 
implanted with a multi-electrode recording array in the hand area of 
M1. In a separate surgical procedure, we implanted intramuscular 
electrodes for recording and stimulation of hand and forearm muscles. 
Figure 2a shows the neural discharge recorded under normal condi- 
tions from a representative session. Most of these 104 neuronal signals 
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Figure 1 | Brain-controlled FES. The monkeys’ forearm and digit flexor 
muscles were temporarily paralysed by a peripheral nerve block. Recordings 
from the motor cortex were used to infer the monkeys’ attempted patterns of 
muscle activity and thereby control electrical stimulation that restored the 
monkeys’ ability to perform a functional grasping task. The ball-grasp device 
was equipped with a contact sensor and a task-success sensor that were 
activated when the monkeys initially touched the ball and dropped it into the 
tube, respectively. 
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were well-modulated during at least some portion of the task. Offline, 
typically 50-75% of the neuronal signals could be discriminated as 
single neurons, on the basis of the consistency of their waveform shape 
and inter-spike interval histogram distribution. However, under real- 
time conditions, only about one-third of the inputs were well- 
discriminated single units; the remainder were signals that included 
action potentials from more than one neuron. Figure 2b shows the 
discharge of these neurons averaged over 229 trials, aligned to the time 
of contact with the ball. The varied phasing of the different neurons is 
evident. 

We recorded from flexor and extensor muscles of the hand and 
forearm simultaneously with the neural recordings (Fig. 2c, d). 
There was considerable variation both in the magnitude and duration 
of electrical activity that occurred from trial to trial (Fig. 2c), and in the 
average timing and patterns of activation of the different muscles 
(Fig. 2d). 

We were able to predict electromyographic (EMG) activity with 
very high accuracy, typically from approximately 100 neural signals 
(Fig. 2c, d; red traces), using Wiener cascade decoders. These decoders 
consisted of multiple-input, linear-impulse response functions 
between the neural inputs and each muscle, followed by a static non- 
linearity. Each impulse response was composed of ten lags spanning 
500 ms. At the beginning of each week, we collected 20 min of data 
under normal conditions, and we used this to compute the coefficients 
for the decoder that were then used for the remainder of the week. 
Accuracy was represented by R’, calculated using a multi-fold cross- 
validation procedure described in the Supplementary Information. 

Using these real-time predictions of muscle activity, we stimulated 
up to five electrodes in three different muscles (flexor carpi radialis 
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Figure 2 | Grasp-related raw data collected during normal conditions. 

a, Firing rates of 104 neuronal signals recorded during series of two grasps. 
b, Ensemble average of 229 trials aligned to the time of ball contact. c, Actual 
and predicted EMG during the same period as (a), with the muscles ordered by 
the relative times of their onset, including extensor digitorum communis 
(EDC), flexor carpi radialis (FCR), first dorsal interosseous (FDI), flexor 
digitorum profundus (FDP), and extensor carpi radialis (ECR). Predicted EMG 
was computed using multiple-input linear-impulse response decoders built 
from data collected earlier in the session. Vertical dashed lines mark the times of 
ball contact. R? values indicate prediction accuracy for the 20-min data file. 
d, Ensemble averages of EMG activity, aligned to the time of initial contact. Blue 
shaded regions, +1 s.d. around the mean. 


2 | NATURE | VOL 000 | 00 MONTH 2012 


(FCR), medial and lateral sites in the flexor digitorum superficialis 
(FDS) and flexor digitorum profundus (FDP)). By these means, we 
have restored in two monkeys the ability to pick up and move objects 
despite complete paralysis of the flexor muscles in the forearm 
and hand. We began each FES experimental session by collecting data 
under normal conditions to establish baseline performance. Following 
these baseline recordings, we injected lidocaine through nerve cuffs 
implanted proximal to the elbow that blocked the median and ulnar 
nerves. After 15-20 min the nerve block was complete, as determined 
by the loss of flexor muscle EMG activity (see Supplementary 
Information and Supplementary Fig. 1), and the onset of profound 
motor deficits. We made periodic tests of nerve-block effectiveness 
throughout each session (Supplementary Fig. 2), and we used a stan- 
dardized stimulus train to evaluate the level of fatigue induced by the 
stimulation (Supplementary Fig. 2). 

A series of four trials is shown in Fig. 3a, b, showing typical neural 
discharge, predicted EMG and stimulus commands, as well as markers 
of the monkey’s performance. Although the common digit flexors 
(FDS and FDP) are normally activated nearly synchronously, FDS 
activation tended to be more sustained, whereas FDP was more phasic. 
The pulse widths of the stimulus trains used to activate a given muscle 
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Figure 3 | Grasp performance during four consecutive brain-controlled FES 
trials. a, Neural data. b, Predicted EMG signals (red traces) transformed into 
stimulus commands (black traces). Vertical dashed lines: go tone (Go), time of 
initial ball contact (Pick up) and successful task completion (Reward). 

c, Horizontal lines show average success rates for sequential 10-min blocks during 
two separate experimental sessions (indicated by light and dark horizontal lines, 
respectively). Each session included both FES trials (green lines) and catch trials 
without stimulation (blue lines). The neuroprosthesis markedly improved the 
monkeys’ ability to grasp the ball despite paralysis. d, Average success rates for 
pre-block (Pre), FES and catch (Catch) trials across all sessions (100%, 76% and 
10% for Monkey T; 99%, 80% and 1% for Monkey J). The total number of trials 
(successful and failed) is displayed on the bars for each condition. 
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were determined from the predicted EMG for that muscle using a 
mapping procedure described in the Supplementary Information 
and Supplementary Fig. 3. The distribution of these pulse widths 
throughout the full range from 0 to 200 pis suggests that the monkey 
was able to grade the strength of contraction continuously (Supplemen- 
tary Fig. 4). During the FES trials, the monkey grasped and moved the 
ball reliably. The movements did not differ sufficiently from normal to 
be obvious to casual observation (see Supplementary Movies 1 and 2 
for representative examples from both monkeys). On occasional ‘catch’ 
trials, we turned off the neuroprosthesis at the beginning of the trial, to 
test the ability of the monkey without FES. In the example of a catch 
trial illustrated here (note the flat stimulus trace in Fig. 3b), the monkey 
was unable to grasp the ball despite the considerable effort apparent in 
the neural discharge and predicted EMG. 

After the onset of paralysis, each experimental session consisted of a 
series of 10-min sets of trials like those in Fig. 3, in which the monkey 
attempted to complete the grasp task either with or without FES assist- 
ance. Two complete sessions for both monkeys are summarized by the 
horizontal light and dark lines in Fig. 3c. The success rate in these 
sessions using the neuroprosthesis was approximately 80% and 90% 
for the two monkeys, respectively (green lines). In contrast, the average 
catch-trial success rate was 5% for monkey T and 0% for monkey J 
(blue lines). The average number of trials per session varied substantially 
across sessions, with a mean of 272 + 84 for monkey T, and 208 + 112 
for monkey J. Although we tried different types of balls, we did not 
systematically examine the effects of size, weight or texture on the 
monkeys’ performance. It is likely that the FES success rate would have 
been lower if balls that were substantially heavier or more slippery had 
been used. We chose to use balls that in size and weight mimicked 
objects grasped in routine human tasks (for example, eating an apple). 

Figure 3d summarizes both monkeys’ overall success rate across all 
sessions, both with the FES neuroprosthesis and during catch trials. 
Both monkeys achieved a success rate of approximately 80% using 
the neuroprosthesis, a level that was highly significantly different 
(P < 0.0001) from that of the catch-trial condition. In addition to result- 
ing in a greatly improved success rate, the FES neuroprosthesis also 
significantly increased the speed at which the monkeys completed 
successful trials (not shown; P < 0.0001 for both monkeys, two-tailed 
Mann-Whitney test). 

To test force control more systematically we conducted a second set 
of experiments with monkey J, who was trained to control the vertical 
displacement of a cursor that moved in proportion to palmar grasp 
force. Using the neuroprosthesis, the monkey was able to squeeze a 
pneumatic tube, and to track up to three different targets ranging from 
15 to 80% of his normal maximum voluntary contraction (MVC), each 
target having a width of approximately 20% of MVC. To be successful, 
the monkey needed to maintain the target force for 0.5 s. Figure 4 
shows a short sequence of data during this target tracking task. One 
of these four trials was a catch trial. The monkey was unable to generate 
any force during the catch trial despite two attempts that are evident in 
the predicted EMG signals. 

We quantified this performance by measuring the mean force and 
stimulation pulse width during the target-hold periods of the initial and 
final 10 min of the session. Despite considerable FES-induced fatigue, 
the monkey remained able to achieve the required force throughout the 
session by voluntarily increasing the mean stimulus pulse width (see 
Supplementary Fig. 5). The increased pulse width reflects an increase in 
cortical activity and resultant EMG predictions. The monkey seemed to 
overcome the fatigue in a manner similar to that of normal conditions, 
increasing its effort to regulate force accurately. 

The monkey’s ability to control both a well-regulated palmar grasp 
as well as to execute the unconstrained natural grasp is powerful 
evidence of the impact that this FES neuroprosthesis could have in 
eventual clinical application. Our neuroprosthesis makes use of patterns 
of activity in M1 that reflect the patterns that occur naturally during 
grasp. By matching patterns of neuronal activity to those muscles with 
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Figure 4 | FES used to produce controlled palmar grasp force during the 
palmar grasp task. a, Colour-coded neural discharge recorded during a series 
of five trials. The rectangles indicate times of target appearance and 
disappearance, as well as their upper and lower force bounds. The white trace is 
the force generated by the monkey, resulting from stimulation of FDS and FDP 
(Stim commands). b, There were four successful trials with FES (green targets) 
and one unsuccessful catch trial (blue target). During the catch trial, the 
monkey made two unsuccessful attempts to squeeze the tube, as seen in the 
EMG predictions (also evident in the neural discharge (a)). 


which they are normally most closely correlated, we hope to maintain 
the natural coupling between cortical activity and motor output. 

It is important to note that this process in no way limits the ability of 
the brain to adapt further, to compensate for inaccuracies in the 
decoded signals or the stimulus-evoked contractions. Even with 
adaptation, it is difficult to imagine howa small number of individually 
conditioned, randomly selected neurons could yield an adequate level 
of control without the type of pre-programming that is necessary with 
existing FES systems. Indeed, there is no evidence that it is possible to 
learn to associate the simultaneous activity of two, three or more 
neurons with independent patterns of muscle activity. Even if possible, 
the cognitive load associated with this effort would presumably be 
rather high, whereas the reliability of a neuroprosthesis relying on a 
small number of conditioned neurons would be quite low. 

Our model of paralysis avoided many of the complications of actual 
spinal cord injury, including muscle denervation and spasticity’?”°. 
Furthermore, it was limited to the forearm and digit flexors. Patients 
with C5 and C6 spinal cord injury retain voluntary control of proximal 
arm muscles while losing full control of the more distal limb. Many 
patients retain or regain some level of voluntary wrist extension’’. As 
we did not paralyse the monkeys’ extensor muscles in this experiment, 
it is important to recognize the good coordination between the remaining 
natural muscle control and that achieved through the neuroprosthesis. 
We routinely obtained extensor EMG predictions that were in fact 
slightly more accurate than those of the flexors. In future experiments, 
we intend to expand our control to these muscles. 

This technology may offer even greater advantages to patients with 
more severe injuries, who have a greater need for replaced function but 
possess even fewer available sources of control”. In addition to the 
ability to predict the reach-related activity of distal limb muscles con- 
sidered in this study, we have previously showed the same ability in 
relation to proximal limb muscles, suggesting the possibility of extend- 
ing this control to these muscles’’. As well as offering patients greater 
independence, FES is also established as an effective means for exercis- 
ing the muscles of paralysed patients, bringing a range of health benefits: 
stronger muscles and bones, improved metabolism, cardiorespiratory 
health and reduced propensity to pressure sores****. It may be that 
drawing ona conscious process to restore natural movement will bring 
the additional benefit of improved psychological health”. 
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METHODS SUMMARY 

Experimental subjects and task. Two monkeys were trained to perform a ball- 
grasp task (Fig. 1) and one of the monkeys was also trained to perform a 
controlled-force palmar grasp task. The monkeys were allowed 5 s to grasp one 
of several balls (ranging in size from 25-40 mm in diameter and 55-130 g) and 
place it into the top of a dispenser tube. The palmar grasp task required the 
monkey to squeeze a pneumatic tube that controlled movement of a cursor. 
Force targets were chosen from a set of two or three non-overlapping levels. All 
procedures were approved by the Institutional Animal Care and Use Committee of 
Northwestern University, Illinois, USA. 

EMG prediction. Inputs consisted of roughly 100 single and multi-unit signals 
from a 100-electrode array (Blackrock Microsystems) implanted within the hand 
area of M1. Decoders consisted of multiple-input impulse response functions 
between the neural inputs and each muscle, subsequently transformed by a 
second-order static nonlinearity to reduce the baseline noise in the predictions 
and to increase the gain near the EMG peaks'*'®. We computed decoders at the 
beginning of each week, which were used in daily sessions for the remainder of the 
week. We conducted 20 sessions with 7 decoders across 7 weeks for monkey T and 
27 sessions with 6 decoders across 11 weeks with monkey J. 

Stimulation. All muscles were stimulated at a single, fixed rate of either 25 or 30 
Hz to achieve nearly fused contractions. The EMG predictions were transformed 
into stimulus pulse widths by mapping the predicted EMG noise floor to the 
stimulus force threshold, and the maximum predicted EMG to the maximum 
pulse width (200 1s; see Supplementary Fig. 3). The current, typically 2-8 mA, 
was chosen independently for each electrode, to yield forces of approximately 50% 
of the maximal evocable force at 200-[1s pulse width. 
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Cell attachment protein VP8* of a human rotavirus 
specifically interacts with A-type histo-blood 


group antigen 


Liya Hu', Sue E. Crawford”, Rita Czako’, Nicolas W. Cortes-Penfield*, David F. Smith*, Jacques Le Pendu***°, Mary K. Estes? 


& B. V. Venkataram Prasad’? 


As with many other viruses, the initial cell attachment of rotaviruses, 
which are the major causative agent of infantile gastroenteritis, is 
mediated by interactions with specific cellular glycans’ *. The distally 
located VP8* domain of the rotavirus spike protein VP4 (ref. 5) 
mediates such interactions. The existing paradigm is that 
‘sialidase-sensitive’ animal rotavirus strains bind to glycans with 
terminal sialic acid (Sia), whereas ‘sialidase-insensitive’ human 
rotavirus strains bind to glycans with internal Sia such as GM1 
(ref. 3). Although the involvement of Sia in the animal strains is 
firmly supported by crystallographic studies'**”, it is not yet known 
how VP8* of human rotaviruses interacts with Sia and whether their 
cell attachment necessarily involves sialoglycans. Here we show that 
VP8* of a human rotavirus strain specifically recognizes A-type 
histo-blood group antigen (HBGA) using a glycan array screen com- 
prised of 511 glycans, and that virus infectivity in HT-29 cells is 
abrogated by anti-A-type antibodies as well as significantly enhanced 
in Chinese hamster ovary cells genetically modified to express 
the A-type HBGA, providing a novel paradigm for initial cell 
attachment of human rotavirus. HBGAs are genetically determined 
glycoconjugates present in mucosal secretions, epithelia and on 
red blood cells®, and are recognized as susceptibility and cell attach- 
ment factors for gastric pathogens like Helicobacter pylor? and 
noroviruses’’. Our crystallographic studies show that the A-type 
HBGA binds to the human rotavirus VP8* at the same location as 
the Sia in the VP8* of animal rotavirus, and suggest how subtle 
changes within the same structural framework allow for such 
receptor switching. These results raise the possibility that host sus- 
ceptibility to specific human rotavirus strains and pathogenesis are 
influenced by genetically controlled expression of different HBGAs 
among the world’s population. 

Rotaviruses are classified on the basis of the neutralization specificity 
of the outer capsid proteins VP7 and VP4 into G (VP7) and P (VP4) 
genotypes following a dual nomenclature system similar to influenza 
viruses"’. The crystallographic structures of VP8* from two sialidase- 
insensitive human strains, representing P[8] (Wa)’ and P[4] (DS1)”, 
from two sialidase-sensitive animal strains, representing P[3] 
(RRV)°*’and P[7] (CRW-8)', and the structures of two animal VP8* 
with bound Sia’*"’ have been previously reported. NMR, cell-binding 
and neutralization assays showed that the sialidase-insensitive P[8] Wa 
strain binds to gangliosides such as GM1 using internal Sia’. These 
studies suggested that whereas the sialidase-sensitive strains recognize 
glycans with terminal Sia such as GD1a, the sialidase-insensitive 
rotavirus strains bind to gangliosides such as GM1 with an internal 
Sia moiety, and gave rise to the notion that Sia is the key determinant 
for host-cell recognition in rotaviruses. Our goal was to determine 
whether all sialidase-insensitive human rotavirus genotypes recognize 
gangliosides with an internal Sia moiety for initial cell attachment, or 


whether they recognize different glycans in a genotype-dependent 
manner. VP8* (amino acids 64-224), cloned from a human rotavirus 
strain (HAL1166) first isolated from a child in Finland"’, was expressed 
in Escherichia coli, purified to homogeneity and crystallized for 
structural analysis. The sialidase-insensitive HAL1166_ strain, 
phylogenetically and serologically belongs to G8P[14] genotype”. 
Although not as prevalent as the P[4] and P[8] genotypes, the P[14] 
genotypes are being increasingly documented by global rotavirus 
surveillance”, and P[14] human rotaviruses are thought to be able 
to jump from animal to human hosts’’. 

The structure of the HAL1166 VP8* determined to 1.5 A resolution 
shows the characteristic galectin-like fold with two twisted f-sheets 
separated by a shallow cleft as observed in the VP8* structures from 
other rotavirus strains (Fig. 1a). The structure of P[14] VP8* super- 
imposes well with all of the VP8* structures previously determined. 
One significant difference between these structures is in the width of 
the cleft separating the two twisted B-barrel sheets (Fig. 1b). In P[14] 
VP8*, it is narrower than the cleft in the VP8* of the other two human 
strains, similar to that in the VP8* of the animal strains. In the animal 
VP8* structures, Sia binds near the cleft (Fig. 1c). Although the cleft in 
the P[14] VP8* structure is of similar dimensions as in the animal 
VP8* structures, the structural features in this region of P[14] VP8* is 
not compatible with Sia binding. In addition to changes in the amino 
acid residues (Fig. 1d) and side-chain orientations, the positioning of 
the amino acid residues is slightly shifted in this region because of an 
insertion (amino acid 187). The side-chain of Y188 is oriented in such 
a way that it would cause steric hindrance to Sia binding (Fig. 1c). 
Furthermore, the P[14] VP8* structure with a narrower cleft and 
several amino acid changes (Fig. 1d) is not compatible with binding 
of GM1 as suggested, based on computer modelling, for VP8* of the 
other human strains with a wider cleft’. 

These observations prompted us to undertake a high throughput 
screening of a glycan array comprised of 511 different glycans, includ- 
ing several glycans with terminal or internal Sia. Such screening, which 
has been used to identify cellular glycans for a variety of pathogens 
including bacterial toxins’, influenza viruses’? and polyomavirus”, 
unambiguously showed specific binding to glycans with a terminal 
GalNAco1-3(Fucau1-2)GalB1-4GlcNAc sequence (Fuc, fucose; Gal, 
galactose; Glc, glucose; NAc, N-acetyl, which is a characteristic of 
A-type HBGA (Supplementary Table 1, Supplementary Figs 1 and 2). 
None of the sialylated glycans, with either internal or terminal Sia, 
showed significant binding (Supplementary Table 1, blue). 

To understand the structural interactions between P[14] VP8* and 
A-type HBGA, we co-crystallized VP8* with tri- and tetrasaccharides 
that correspond to the terminal structure in the A-type HBGA. The 
structure of the complex, determined to a similar resolution of 1.5 Aas 
the unliganded structure, clearly showed density for the bound ligand 
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Figure 1 | VP8* structure of HAL1166 P[14] human rotavirus strain and 
structural comparison with other VP8* structures. a, Cartoon 
representation of the P[14] VP8* structure showing a galectin-like fold with the 
characteristic two twisted B-sheets (in blue and green) separated by a cleft. The 
Sia binding site as observed in the animal VP8* is shown by a box. The B-ribbon 
and the carboxy-terminal «-helix of the structure are shown in red. The amino 
and carboxy termini are denoted. b, Structural alignment of P[14] VP8* 
structure (blue) with VP8* structure of sialidase-sensitive animal rotavirus 
RRV strain (green, PDB ID 1KQR) and VP8* of sialidase-insensitive human 
rotavirus strain Wa (orange, PDB ID 2DWR) shown in the same orientation as 
ina. The width of the cleft in the P[14] VP8* is narrower (red arrow) than in the 
Wa VP8* structure (black arrow). c, P[14] VP8* shows changes in the amino 
acid composition and side-chain orientations in the region corresponding to 
Sia binding site of RRV VP8*. The amino acid residues interacting with Sia in 
the RRV VP8* structure are shown as green sticks, and bound Sia is shown as 


(Fig. 2a). The HBGA binds near the cleft region at the same location as 
the Sia in the VP8* structures of the animal rotavirus strains (Fig. 2b 
and Supplementary Fig. 3). The binding of HBGA, which does not 
cause any conformational changes in the VP8*, involves a network of 
both direct and solvent-mediated hydrogen bond interactions, in 
addition to several stabilizing hydrophobic contacts (Supplementary 
Fig. 4). The terminal GalNAc and Gal of the HBGA participate in all of 
the interactions (Fig. 2c). The GalNAc participates in direct hydrogen- 
bonding interactions involving the side chains of R101 and T191, 
and hydrophobic interactions involving L190 and T191, whereas Gal 
participates in hydrogen-bonding interactions with the main chain 
carbonyl groups of Y189 and $187, and hydrophobic interactions with 
Y189 and Y188. The proximal sugar moieties project out from the 
surface of VP8* without making any direct contacts with VP8* 
(Fig. 2 and Supplementary Fig. 3). 

The structure of P[14] VP8* with HBGA shows how subtle amino 
acid changes result in altered ligand specificity. The only positionally 
conserved common residue between P[14] and animal VP8* that 
participates in the ligand interactions is R101. Although the position 
matches well, its side-chain orientation differs significantly between 
the two structures (Fig. 1c). Without this change, the side-chain of 
R101 in the P[14] VP8* structure would cause steric hindrance to the 
GalNAc moiety of the HBGA. Most other residues that participate in 
the ligand interactions are upstream of position 187 in the VP8* 
sequence, where an insertion occurs in the P[14] VP8*. This insertion 
along with sequence variation alters the configuration of the binding 
site in the P[14] VP8* to allow specific interaction with HBGA. 
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yellow sticks with oxygen atoms in red. The residues in this region of the P[14] 
VP8* structure are shown as blue sticks. The residue numbering corresponds to 
HAL1166 VP8*. The key amino acid changes in this region between P[14] and 
RRV VP8* are indicated by black arrows. Noticeable is how Y188 in the P[14] 
VP8* structure causes steric hindrance if Sia were to bind in this region. Also 
noticeable is the change in side-chain orientation of the conserved R101. 

d, Alignment of HAL1166 VP8* with other VP8* sequences from sialidase- 
sensitive animal (RRV, CRW-8) strains, and sialidase-insensitive human 
rotavirus (Wa and DS1) strains. The residues that interact with Sia in the 
animal VP8* as shown in care indicated by red stars on the top of the sequence. 
Highly conserved (>80%), and moderately conserved (>60%) regions are 
coloured in dark and lighter grey, respectively. The root mean square deviation 
(r.m.s.d.) of the matching Cx atoms between the P[14] VP8* and other VP8* 
structures along with percentage of sequence identity are shown in the table on 
the right. 


Insertion at position 187 causes a localized change which makes the 
side-chain of Y188 clash with Sia when placed in the structure of P[14] 
VP8* (Fig. 1c). Following this residue, the polypeptide chain reverts 
back to the same course as in the animal VP8* structures such that 
Y189 and T191, which interact with HBGA, are now positionally 
equivalent to Y188 and $190 which interact with Sia in the animal 
VP8* (Figs 1c and 2c). 

The remarkable overlap of the HBGA binding site in the P[14] VP8* 
with that of the Sia in the animal VP8* structure strongly suggests that 
A-type HBGA is a cell attachment factor for P[14] rotavirus strains. To 
examine the biologic relevance of HBGA binding to P[14] VP8*, virus 
infectivity assays were performed. Using intestinal HT29 cells isolated 
from a type A individual, dose-dependent abrogation of HAL1166 
infectivity was observed, with a greater than 75% reduction at the 
highest concentration of anti-A-type HBGA antibody compared to 
an isotype control antibody (Fig. 3a, b). In contrast, this antibody 
did not inhibit the sialidase-sensitive SA11 rotavirus strain (Fig. 3a). 
To ascertain further the specificity of the HAL1166 to A-type HBGA, 
infectivity assays were performed using parental Chinese hamster 
ovary (CHO) cells, which do not express any HBGA, and genetically- 
engineered CHO cells expressing either A- or H-type HBGA. CHO 
cells expressing type A HBGA showed a large increase in infectivity 
with HAL1166, but not SA11, compared to parental CHO cells or those 
expressing type H alone (Fig. 3c). Similarly, low infectivity was 
observed in Caco-2 cells, isolated from a blood type O individual, 
compared to HT29 cells that express type A HBGA (data not shown). 
Specificity to A-type HBGA was further confirmed by performing 


©2012 Macmillan Publishers Limited. All rights reserved 


ik o «Gal 
: AcHN 0: ss i 
GalNac “° ae 1 


Figure 2 | Structural analysis of P[14] VP8*-A-type HBGA interactions. 
a, The chemical structure of the A-type trisaccharide (above) and simulated 
annealing omit difference map (below), contoured at 3a level, showing the 
binding of A-type trisaccharide to P[14] VP8*. Bound A-type trisaccharide 
(GalNAc, N-acetylgalactosamine; Gal, galactose; Fuc, fucose) is shown in a ball- 
and-stick representation (yellow) inside the map with its carbon atoms 
numbered following the standard convention. The nitrogen and the oxygen 
atoms in the trisaccharide are coloured in blue and red, respectively. b, Surface 
representation of the P[14] VP8* structure (grey) with the bound 
A-trisaccharide shown in stick representation (with the same colour scheme as 


haemagglutination assays using P[14] glutathione-S-transferase 
(GST)-VP8*. Type A blood cells, but not type O or B, were haemag- 
glutinated by soluble VP8* (Supplementary Fig. 5). 

This is the first study describing the structural interactions between 
a human VP8* and a cellular glycan. Importantly, it shows that bind- 
ing of sialylated glycans is not obligatory among sialidase-insensitive 
human rotavirus strains. Our finding that P[14] VP8* specifically 
recognizes HBGA raises important questions such as whether other 
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Figure 3 | HAL1166 rotavirus specifically recognizes A-type HBGA. 

a, Dose-dependent inhibition of HAL1166 infection in HT29 cells by anti-A- 
type antibody (Ab, white bars for HAL1166, and black bars for SA11). Isotype 
control IgG3 did not inhibit HAL1166 infectivity (grey bars). Error bars (also in 
Fig. 3c) represent standard deviation and the P values were determined by 
Student’s t-test, n = 3. *All concentrations of anti-A-type antibody reduced 
infectivity compared to control with P < 0.05. b, Representative 
immunofluorescence microscopy images of HT29 cells infected with HAL1166 
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in a). The acetamido group of GalNAc inserts into a well-defined pocket in the 
VP8* structure. The amino acid residues in the P[14] VP8* which participate in 
hydrogen bond and hydrophobic interactions with the trisaccharide are 
indicated in blue. c, Network of hydrogen bond interactions (dashed lines) 
between the VP8* residues (light blue) and A-type trisaccharide (coloured as in 
a). Participating water molecules are shown as small spheres (cyan). More 
detailed interactions between VP8* and the ligand are given in Supplementary 
Fig. 4. The terminal two saccharide moieties of the A-type tetrasaccharide 
(GalNAco1-3(Fuco1-2)GalB1-4GIcNAc) also show similar interactions with 
the VP8* (Supplementary Fig. 3). 


human rotavirus strains interact with similar or other HBGAs in a 
serotype-dependent manner like in human noroviruses’®, and whether 
genetically controlled differential expression of HBGAs among world’s 
population plays a role in susceptibility to human rotaviruses. In a 
recently published paper, G8P[14] rotavirus was identified in the stool 
samples from two adults with diarrhoea, who lived in the same geo- 
graphical area in Denmark*’. The blood type of one of these patients 
and of another patient infected with a G6P[14] virus was type A (B. 
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rotavirus in the presence of 50 jg ml’ of IgG3 (top) and anti-A-type antibody 
(bottom). ¢, Infectivity of SA11 P[1], HAL1166 P[14] and K8 P[9] rotavirus 
strains in the parental CHO cells (H-/A-), the single transfectant with the Fut2 
enzyme (H+/A-), and the double transfectant with both Fut2 and A type histo- 
blood group glycosyltranferase (H+/A+). The fold difference in infectivity was 
determined compared to parental cells. For HAL1166 and K8 human 
rotaviruses, the increase in infectivity in CHO (A+/H+) cells was compared to 
parental CHO, and CHO (H+/A-), the P values were <0.01. 
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B6ttiger, personal communication). Although this is a small sample 
size, these findings warrant further epidemiological studies to deter- 
mine whether HBGA is a susceptibility factor for rotaviruses. Based on 
sequence comparisons, our prediction that VP8* of the K8 human 
rotavirus strain (P[9] genotype) would also recognize A-type HBGA 
(Supplementary Fig. 6) is firmly supported by infectivity assays with 
parental and derivative CHO cells (Fig. 3c). 

The double-stranded RNA rotaviruses, accounting for approxi- 
mately 500,000 child deaths annually worldwide”, have enormous 
genetic and strain diversity. In addition to point mutations and gene 
rearrangements, genetic reassortment between co-circulating strains, 
similar to influenza viruses, contribute to the expanding diversity of 
rotaviruses****. Current evidence indicates that many of the human 
rotavirus strains, including the P[14] HAL1166 strain’’, originated 
from animal reservoirs through reassortment and inter-species trans- 
mission™*, Although effective vaccines are currently available, 
whether they will remain effective with such expanding virus diversity 
is an open question””®. Discovery that a human rotavirus strain with 
host-switching capabilities binds to a non-sialylated but novel glycan 
receptor opens new approaches to better understand the molecular 
basis of critical human rotavirus—host interactions, which probably 
influences host specificity, cell specificity, pathogenesis and virus 
evolution. 


METHODS SUMMARY 


Expression, purification, and crystallization of P[14] VP8* and its complex with 
A-type oligosaccharides, and structure determination, using RRV VP8* structure 
(PDB ID: 1KQR) as a molecular replacement model, and refinement was carried 
out as described in the Methods section. Diffraction data were collected at Baylor 
College of Medicine using a Rigaku FR-E+ SuperBright rotating anode. See 
Supplementary Table 2 for data collection and refinement statistics. The 
carbohydrate-binding specificity of P[14] VP8* was investigated using glycan 
array v4.2 with 511 glycans in replicates of six (Consortium for Functional 
Glycomics Protein-Glycan Interaction Core (H) (http://www.functionalglycomics. 
com). GST-tagged VP8* bound on the glycan array was detected using a 
fluorescent-labelled anti-GST monoclonal antibody. Infectivity assays were per- 
formed as previously described*”’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein expression and purification. VP8* (amino acids 64-224) of HAL1166 
rotavirus strain (P[14] genotype) was cloned into an expression vector pGEX-2T 
(GE healthcare) with an N-terminal GST tag and a thrombin cleavage site. The 
recombinant GST-VP8* was expressed in E. coli BL21 (DE3) (Novagen) and 
purified by Glutathione Sepharose 4 Fast Flow (GE healthcare). The GST tag 
was cleaved by using thrombin before rebinding the protein mixtures onto a 
Glutathione Sepharose column to remove the GST, leaving Gly-Ser at the N 
terminus. The VP8* was then filtered and further purified by size-exclusion chro- 
matography on a Superdex-75 (GE healthcare) column with 10 mM Tris, pH 7.4, 
100mM NaCl, 1mM dithiothreitol (DTT). The concentration of the purified 
VP8* was determined by measuring absorbance at 280 nm and using an absorp- 
tion coefficient of 43,010M 'cm™! calculated using Vector NTI 11 software 
(Invitrogen). 

Crystallization. Crystallization conditions for P[14] VP8* (13.5 mgml') were 
screened by hanging-drop vapour diffusion using the Mosquito crystallization 
robot (TTP LabTech) and visualized using Rock Imager (Formulatrix) at 20 °C. 
The crystals from one of the conditions (30% PEG 1500, sodium acetate trihydrate, 
pH 4.5) were harvested with the screen condition containing 18% glycerol. To 
obtain crystals of VP8*-HBGA complex, VP8* was co-crystallized with A-type 
trisaccharide or tetrasaccharide (purchased from Dextra labs), with a 1:52 or 1:46 
excess molar ratio of ligand under similar condition as the unliganded P[14] VP8*. 
Data collection and processing. Diffraction data for both unliganded and 
liganded VP8* crystals were collected at Baylor College of Medicine using 
Rigaku FR-E+ SuperBright rotating anode. These data were processed with 
DTREK* or IMOSELM as implemented in the CCP4 suite”. Space group was 
confirmed using POINTLESS”. The unliganded and liganded VP8* structures in 
the P2; space group, with one molecule in the asymmetric unit, at ~1.5 A reso- 
lution were determined. For initial phasing, the RRV VP8* structure (PDB ID 
1KQR) was used as a search model for molecular replacement using Phaser*’. 
Following automated model building and solvent addition using ARP/wARP”, 
the structure was refined using PHENIX*’. The oligosaccharide moieties of the 
HBGAs were generated using the SWEET2 package™ of the Glycosciences.de 
server (http://www.glycosciences.de) and modelled into the electron density using 
COOT*® and validated by computing simulated annealing omit maps using 
PHENIX”. The stereochemistry of the oligosaccharides including the allowed 
conformational angles was checked using the CARP* package in the 
Glycosciences.de server. Data collection and refinement statistics are given in 
Supplementary Table 2. Ligand interactions were analysed using COOT and 
LIGPLOT” with donor to acceptor distances between 2.6A and 3.2A for 
hydrogen-bonding interactions, and C-C distances between 3.4A and 4.5 A for 
hydrophobic interactions. The structural alignments and calculations of r.m.s.d. 
were carried out using PyMOL (http://www.pymol.org/). Figures were prepared 
using Chimera**. 

Glycan array screening. The carbohydrate-binding specificity of HAL1166 VP8* 
was investigated on glycan array v4.2 comprised of 511 glycans (Consortium 
for Functional Glycomics, Protein-Glycan Interaction Core-H) (http://www. 
functionalglycomics.org). Recombinant GST-tagged VP8* at decreasing concen- 
trations in binding buffer (20mM Tris-HCl pH7.4, 150mM sodium chloride, 
2mM calcium chloride, 2mM magnesium chloride, 0.05% Tween 20, 1% BSA) 
was applied to separate glycan arrays, and bound protein was detected using a 
fluorescent-labelled anti-GST monoclonal antibody. Summary of the glycan array 
results is given in Supplementary Table 1. Concentration dependent binding at 
20 jg ml and 2 jig mI’ is shown in Supplementary Fig. 1a, b, where the glycans 
are ranked according to their relative binding strengths (Supplementary Fig. 1c) as 
described previously’. 

Inhibition and infectivity assays. Inhibition assays were performed on HT29 
(human intestinal epithelial) cells following previously described protocols*®. The 
monoclonal antibody (MAb) against blood group A antigen (BG-2) was pur- 
chased from Covance. The isotype control antibody (MG3-35) was purchased 
from Abcam. HAL1166 virus was grown as previously described’. Confluent 
HT29 cell monolayers were grown on 96-well plates. Increasing concentrations 
of antibodies were allowed to bind to the cells at 4 °C for 1 h before 400 fluorescent 
focus units (FFU) of virus were added per well. Recombinant VP8* protein com- 
petition was determined by treating the cells with 62.5 or 31 wg ml’ VP8* at 4°C 
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for 30 min before virus inoculation as earlier. After allowing virus attachment to 
HT29 cells for 1h on ice, the inoculum was removed, and the cells were washed 
with cold DMEM and incubated for 16h at 37 °C in 95% (v/v) air with 5% (v/v) 
CO). Virus titres in methanol-fixed cell monolayers were determined by staining 
cells with rabbit anti-rotavirus antibody and Alexa 488-labelled donkey anti-rabbit 
secondary antibody (Invitrogen). Virus infectivity was expressed as the percentage 
of focus-forming units in control wells incubated with the same concentrations of 
bovine serum albumin. Data are given as the mean of three replicates, and the bar 
indicates the standard deviation. Data were analysed by Student's f test (two-tailed 
test). 

For infectivity assays, SA11, HAL1166 or K8 viruses were serially diluted and 

incubated on confluent monolayers in 96-well plates of HT29, Caco-2 and CHO 
cells (parental cells or cells expressing type H HBGA, or type H and type A HBGA 
prepared as previously described’). Following attachment for 1h at 37°C, the 
cells were washed, incubated for 16h and processed as described above for 
immunofluorescence. 
Haemagglutination assay. Pooled human red blood cells were purchased from 
Immucor. The RBCs were packed via centrifugation at 500g for 10 min and 0.5% 
suspensions of each RBC type were prepared in 0.85% saline (pH 6.2). GST (nega- 
tive control) or GST-tagged VP8* were serially diluted via doubling dilution in PBS 
(0.01 M sodium phosphate, 0.15 M NaCl, pH 5.5; passed through a 0.2-j1m pore- 
size filter) on 96-well V-bottom plates (Nunc) in triplicate. The haemagglutination 
activity of the VP8 dimers was tested by mixing 50 ul of the prep (starting dilution 
of 10 nM) with an equal volume of the RBC suspensions. Recombinant norovirus 
virus-like particles (Norwalk virus, genogroup GI.1 and Houston virus, genogroup 
GIL4) were included as positive controls (5 jg ml’ and 10 pg mI starting dilu- 
tions, respectively) because they have well-characterized haemagglutination activ- 
ity that is known to be mediated by interaction with histo-blood group antigens on 
the surface of the red blood cells**. The reaction was allowed to proceed for one 
hour at 4°C before results were recorded. The titer recorded was the highest 
dilution of sample that prevented the complete sedimentation of red blood cells 
to the bottom of the well as compared to the negative controls. A commercial blood 
typing antibody specific for the B antigen was purchased from Immucor and tested 
at a starting dilution of 1:100 as another positive control. 


28. Pflugrath, J. W. The finer things in X-ray diffraction data collection. Acta 
Crystallogr. D 55, 1718-1725 (1999). 

29. Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta 
Crystallogr. D 67, 235-242 (2011). 

30. Evans, P. Scaling and assessment of data quality. Acta Crystallogr. D 62, 72-82 
(2006). 

31. McCoy,A.J.eta/. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658-674 

(2007). 

32. Morris, R.J.,Perrakis, A. & Lamzin, V.S. ARP/wARP and automatic interpretation of 

protein electron density maps. Methods Enzymol. 374, 229-244 (2003). 

33. Adams, P. D. et al. PHENIX: a comprehensive Python-based system for 

macromolecular structure solution. Acta Crystallogr. D 66, 213-221 (2010). 

34. Bohne, A, Lang, E. & von der Lieth, C. W. SWEET - WWW-based rapid 3D 

construction of oligo- and polysaccharides. Bioinformatics 15, 767-768 (1999). 

35. Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of 

Coot. Acta Crystallogr. D 66, 486-501 (2010). 

36. Liitteke, T., Frank, M. & von der Lieth, C. W. Carbohydrate Structure Suite (CSS): 
analysis of carbohydrate 3D structures derived from the PDB. Nucleic Acids Res. 
33, D242-D246 (2005). 

37. Wallace, A. C., Laskowski, R. A. & Thornton, J. M. LIGPLOT: a program to generate 
schematic diagrams of protein-ligand interactions. Protein Eng. 8, 127-134 
(1995). 

38. Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory 
research and analysis. J. Comput. Chem. 25, 1605-1612 (2004). 

39. Smith, D. F., Song, X. & Cummings, R. D. Use of glycan microarrays to explore 
specificity of glycan-binding proteins. Methods Enzymol. 480, 417-444 (2010). 

40. Haselhorst, T. et al. Sialic acid dependence in rotavirus host cell invasion. Nature 
Chem. Biol. 5, 91-93 (2009). 

41. Crawford, S. E. et a/. Rotavirus viremia and extraintestinal viral infection in the 
neonatal rat model. J. Virol. 80, 4820-4832 (2006). 

42. Guillon, P. et al. Inhibition of the interaction between the SARS-CoV spike protein 
and its cellular receptor by anti-histo-blood group antibodies. Glycobiology 18, 
1085-1093 (2008). 

43. Hutson, A. M., Atmar, R. L, Marcus, D. M. & Estes, M. K. Norwalk virus-like particle 
hemagglutination by binding to H histo-blood group antigens. J. Virol. 77, 
405-415 (2003). 


©2012 Macmillan Publishers Limited. All rights reserved 


Lait ees. 


doi:10.1038/nature10997 


Restoration of vision after transplantation of 


photoreceptors 


R. A. Pearson', A. C. Barber!, M. Rizzi', C. Hippert’, T. Xue’, E. L. West!, Y. Duran', A. J. Smith!, J. Z. Chuang", S. A. Azam!, 
U. F. O. Luhmann!, A. Benucci?, C. H. Sung’, J. W. Bainbridge!, M. Carandini’, K.-W. Yau, J. C. Sowden? & R. R. Alib® 


Cell transplantation is a potential strategy for treating blindness 
caused by the loss of photoreceptors. Although transplanted rod- 
precursor cells are able to migrate into the adult retina and differ- 
entiate to acquire the specialized morphological features of mature 
photoreceptor cells’, the fundamental question remains whether 
transplantation of photoreceptor cells can actually improve vision. 
Here we provide evidence of functional rod-mediated vision after 
photoreceptor transplantation in adult Gnat1~'~ mice, which lack 
rod function and are a model of congenital stationary night blind- 
ness’. We show that transplanted rod precursors form classic triad 
synaptic connections with second-order bipolar and horizontal 
cells in the recipient retina. The newly integrated photoreceptor 
cells are light-responsive with dim-flash kinetics similar to adult 
wild-type photoreceptors. By using intrinsic imaging under scotopic 
conditions we demonstrate that visual signals generated by trans- 
planted rods are projected to higher visual areas, including V1. 
Moreover, these cells are capable of driving optokinetic head track- 
ing and visually guided behaviour in the GnatI~'~ mouse under 
scotopic conditions. Together, these results demonstrate the feasibility 
of photoreceptor transplantation as a therapeutic strategy for 
restoring vision after retinal degeneration. 

So far there have been no convincing reports of photoreceptor-cell 
transplantation actually improving the recipient’s vision. This may be 
due to the relatively low numbers of new rod photoreceptors success- 
fully transplanted in previous studies (typically fewer than 1,000 
cells)'**. To establish that new rods truly can improve vision, we 
optimized the rod-transplantation procedure to increase the number 
of newly integrated photoreceptor cells in wild-type mice. The donor- 
cell population was rod-photoreceptor precursors identified by their 
expression of green fluorescent protein (GFP) under control of the 
promoter for the rod-specific transcription factor, Nrl’. We obtained 
maximum integration together with optimal recipient retinal histology 
after the transplantation of 200,000 fluorescence-activated cell sorted 
Nrl-GFP* rod precursors, taken from postnatal day (P) 4-8 NrI-GFP 
donor mice, by subretinal injection to both the superior and the 
inferior retina. This resulted in a 20- to 30-fold increase in the number 
of integrated rod photoreceptors compared with previous studies, with 
up to 26,000 new rods within the outer nuclear layer of recipient adult 
wild-type mice (16,759 + 1,705 cells; Fig. la and Supplementary Fig. 1). 
Integrated cells were found predominantly around the injection sites, 
but were distributed over more than 50% of the retinal area. Thus, up to 
16% of donor cells integrated into the host outer nuclear layer (see 
Supplementary Information for estimation of integrated cell number). 
Supplementary Fig. 1 summarizes how this was achieved. 

To test the functionality of the transplanted rod photoreceptors, we 
selected a murine model in which improvement of rod vision could be 
assessed definitively; the Gnatl ~/~ mouse lacks rod o-transducin 
(Gnat1), a protein essential for rod phototransduction. It has no rod 


function’ or behavioural responses to scotopic visual stimuli; it displays, 
however, near-normal cone histology’, function and behavioural res- 
ponses to photopic visual stimuli (Supplementary Fig. 2 and refs 2, 9) 
and shows no loss of cones with time®. Nrl-GFP* rod precursors inte- 
grated into the adult Gnat1~'~ retina in numbers very similar to those 
observed in wild-type recipients (18,300 + 1474; maximum 32,015 
integrated cells; Fig. la). Integrated cells were appropriately located 
within the outer nuclear layer and morphologically very similar to 
wild-type rods (Fig. 1b-i and Supplementary Fig. 3), correctly expres- 
sing rod -transducin (Fig. 1c, d), which is absent in endogenous 
Gnat1~'~ rods, together with all other mature rod markers examined 
(rod arrestin (Fig. le, f), rhodopsin, recoverin, phosducin (data not 
shown)). They demonstrated outer segment formation (Fig. 1b-f) 
and appropriate light-dependent translocation of rod «-transducin 
(Fig. 1c, d) and arrestin (Fig. le, f)’® between the outer segments and 
the cell body/synapse. Most (>80%) of Nri-GFP* rods located within 
the outer nuclear layer displayed synaptic boutons. All those examined 
by immunohistochemistry (15-20 cells per marker) appropriately 
expressed the rod ribbon synapse proteins ribeye, bassoon and/or dys- 
trophin”’ (Fig. 1g-i). Finally, ultrastructural analysis confirmed that 
integrated Nrl-GFP* rods typically formed the classic triad synapse 
with endogenous horizontal and bipolar neurons (Fig. 1j, k; synapses 
observed for 51/55 cells examined; see Supplementary Information). 

To test for light responses in Nrl-GFP" rods correctly integrated 
within Gnatl~'~ recipients, we performed suction-pipette record- 
ings’ (Fig. 2a, b). Pigment bleached by the GFP-excitation light used 
during Nrl-GFP* cell searching was regenerated with 9-cis-retinal 
before recording (see Supplementary Information). To control for 
the effects of the bleaching/regeneration procedure on the speed of 
the dim-flash response kinetics and the sensitivity of wild-type rods 
(Fig. 2b,), we also recorded from non-injected wild-type retinae with 
and without the excitation-light/pigment-regeneration procedure 
(Fig. 2a, b). All recorded N: rl-GFP* rods with an intact inner segment, 
cell body and synaptic terminal showed light responses (n = 9/9; 
Fig. 2a, b). Their saturated photocurrent (4.6 + 0.5 pA) was similar 
to that of wild-type rods with or without bleaching/regeneration treat- 
ment (Fig. 2b; 6.4 + 0.7 and 6.1 + 0.8 pA, respectively). Dim-flash 
response kinetics, indicated by time-to-peak (tpeak) and integration 
time (t;), were similar between Nrl-GFP”* and wild-type rods subjected 
to bleaching/regeneration (see Fig. 2b). Individual Nri-GFP* rods 
showed variations in sensitivity, with one cell being as sensitive as 
bleached/regenerated wild-type rods and the other three over a log 
unit less sensitive (Fig. 2b). This variation may be real, or simply reflect 
the small sample size (owing to technical challenges in obtaining this 
parameter in bleached/regenerated Nri-GFP* rods). For each Nrl- 
GFP* rod recorded, several adjacent GFP rods were also tested; 
none gave detectable responses (Fig. 2c), thus ruling out light signals 
originating from cones in the Gnat1~'~ retina. 
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Electroretinography (ERG) provides a gross measure of light- 
mediated trans-retinal function, averaged across the whole retina. 
Gnat1‘~ mice received dual transplants of Nrl-GFP* rod precursors 
into one eye and equivalent sham injections (age-matched Gnat1~'~ 
cells or vehicle) or no injection into the contralateral eye (Supplemen- 
tary Fig. 4). ERGs were recorded weekly 3-6 weeks after transplantation 
using a double-masked protocol (see Supplementary Information), 
before the eyes were assessed for Nrl-GFP* rod-photoreceptor integ- 
ration. Despite robust integration (7,135-26,616 integrated cells per 
eye; n = 6), ERG responses were not detected upon scotopic stimulation 
of procedured Gnat1~/~ animals (Supplementary Fig. 4a). In contrast, 
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Figure 1 | Improved transplantation protocols 
significantly improve photoreceptor integration 
into the adult Gnat1~'~ model of retinal 
dysfunction. a, NrI-GFP* rod-photoreceptor 
integration using new and previously published 
protocols'; mean + s.e.m., analysis of variance 
(ANOVA); n, number of eyes. b, Typical example 
of integrated NrI-GFP* rods (green). Scale bar, 50 
um. c-f, Integrated Nrl-GFP* rods expressed rod- 
o-transducin (c, d; red) and rod-arrestin (e, f; red) 
and demonstrated correct, counter-directional 
light-mediated translocation of these proteins 
(bottom panels in c-f, respectively). g-i, Integrated 
Nrl-GFP* rods formed spherule synapses and 
expressed dystrophin (g), bassoon (h) and ribeye 
(i) (all red). Inserts, 3 um projections of regions 
marked. IS, inner segments; OS, outer segments; 
ONL, outer nuclear layer; OPL, outer plexiform 
layer; INL, inner nuclear layer. Scale bar, 10 um. 
j, Electron micrographs of low- (inset) and high- 
power views (consecutive sections), showing Nrl- 
GFP* rod terminals (tT 1, tT2). tT1 formed classic 
triad with horizontal axon terminals (H, orange) 
and bipolar dendritic terminal (B, red). 
Endogenous rod terminals (nT; blue) were DAB 
negative. Arrows, synaptic ribbons; scale bars, 500 
nm. k, Schematic of rod-triad synapse, reproduced 
with permission from Webvision (H. Kolb et al., 
http://webvision.med.utah.edu/). 


such responses were readily recordable in wild-type animals (Sup- 
plementary Fig. 4f). These results were not unexpected because control 
experiments with an adeno-associated (AAV2/8) viral vector to 
deliver a Gnatl transgene showed that a scotopic ERG response was 
recordable only after transduction of approximately 150,000 photo- 
receptor cells, but not of 60,000 rods (Supplementary Fig. 4d-g). 
Robust responses were observed in all groups after photopic stimu- 
lation with no significant differences between sham and Nrl-GFP* - 
treated eyes (Supplementary Fig. 4h). Thus, proper functioning of all 
retinae was maintained throughout and was not impaired by the 
transplantation procedures. 


Figure 2 | Transplanted Nrl-GFP* rod 
photoreceptors are light-responsive and project 
light information to the visual cortex. a, Flash- 
response families from single dark-adapted wild 
type (black), bleached/regenerated wild type (red) 
and bleached/regenerated-Nrl-GFP* (blue) rods. 
See Supplementary Information for flash 
intensities. b, Comparison of saturated response 
amplitude and dim-flash response parameters. 
Individual data points are shown (open circles) 
with mean + s.e.m. (ANOVA, top three panels) or 
median + range (Kruskal-Wallis, bottom panel); 
n, number of cells; , half-saturating flash intensity. 
Not all parameters were obtainable for each cell. 
c, Representative retinal slice showing light- 
sensitive Nrl-GFP* rod (red circle) and 
surrounding non-responsive GFP negative rods 
(white circles). d, Schematic of optical intrinsic 
imaging set-up (see Supplementary Information). 
Au, auditory cortex; $1, somatosensory cortex. 

e, Visual stimuli (Stim) (1) elicited optical signals in 
V1 only from Nrl-GFP* -treated Gnat1‘~ (5) and 
wild type (2), but not untreated (3) or sham- 
injected (4), Gnat1~‘~ eyes. Far right, overlapping 
parts of the four stimuli were colour-coded (1). 
Only responses present for two overlapping stimuli 
were considered genuine sensory-evoked signals 
and represented with the corresponding colour 
(2-5). Scale bar, 1 mm. 
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Given the magnification of retinal signals shown by the visual 
pathway, we reasoned that photoresponses recorded from trans- 
planted Nrl-GFP* rods, although too small in amplitude or too few 
in number to be detected by ERG, could be transmitted to the brain 
and generate activity in defined areas of the visual cortex. We therefore 
performed optical intrinsic imaging’*’* of primary visual cortex 
(Fig. 2d, e and Supplementary Fig. 2b). Four overlapping stimuli cover- 
ing most of the visual field of the stimulated eye were used and 
consisted of flickering black/white bars on a grey background 
(Fig. 2e, row i). When presented to wild-type mice (n = 5), such stimuli 
always evoked strong signals in well-defined areas of the visual cortex 
under both scotopic (Fig. 2e, rowii) and photopic (Supplementary Fig. 2b) 
conditions. Normal retinotopy was observed, as indicated by the different 
cortical locations of the responses to each stimulus (colour maps; Fig. 2e 
and Supplementary Fig. 2b). In contrast, no detectable response was 
observed after scotopic stimulation of untreated Gnatl ~'~ mice (n = 7) 
(Fig. 2e, row iii, and Supplementary Figs 2b and 5). We assessed the 
impact of transplantation by giving Gnat1~'~ animals Nrl-GFP* rod 
precursors or age-matched Gnatl1~'~ (sham) cells, as before. We 
observed no cortical activity in response to scotopic visual stimuli 
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presented to sham-injected eyes (n = 3; Fig. 2e, row iv, and Supplemen- 
tary Fig. 5), demonstrating that neither injection procedure nor pres- 
ence of non-functional cells led to false signals in the neocortex. 
However, presentation of the same stimuli to eyes receiving Nrl-GFP 
transplants led to robust responses in all animals (n = 5; Fisher’s exact 
test P = 0.0175) (Fig. 2e, row v, and Supplementary Fig. 5); partly 
overlapping stimuli gave responses in likewise partly overlapping areas 
of the neocortex, with some preservation of retinotopy in the areas of 
integration (colour maps; Fig. 2e, row v, and Supplementary Fig. 5). 
Robust integration (10,782-31,075 cells) was observed in all animals 
displaying cortical activity. Because transplanted cells predominantly 
integrate near the injection site, it is reasonable that only some stimuli 
led to a reliable cortical response (compare stimuli 1, 3, 4 with stimulus 
2, Fig. 2e, row v). Whenever possible, after using scotopic stimuli we 
assessed photopic stimuli; such stimuli elicited clear responses in all 
groups. 

To address the paramount question of whether photoreceptor 
transplantation confers improvements in vision, we assessed visual 
function first by measuring optomotor head-tracking responses to a 
rotating grating (Fig. 3a, b; see Supplementary Information)”’*"*. 


Figure 3 | Rescue of scotopic optokinetic head- 
tracking behaviour in Nrl-GFP-treated 
Gnat1™'~ mice. a, The Optomotry™ set-up (see 
Supplementary Information). b, Measures of visual 
function include contrast sensitivity and visual 
acuity. c, d, Scotopic contrast sensitivity and visual 
acuity threshold measurements for Nrl-GFP- 
(green bars) or sham-(Gnat1~'~) treated 
Gnat1~'~ eyes, and the averages of left and right 
eyes for Gnat1~'~ mice receiving original protocol 
transplants’ or sham injections to both eyes, or 
untreated Gnat1~/~ or wild type (white bars) 
controls. OR, optomotor response. Paired t-test. 
e, f, Scatter plots of contrast sensitivity and visual 

acuity against integrated Nrl-GFP* rod number. 
] g, h, Photopic contrast sensitivity and visual acuity 
for NrI-GFP- (light grey) and sham-treated (dark 
grey) eyes before and after transplantation. Means 
+ s.e.m.; n, number of animals. 
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Gnat1~'~ mice received dual transplants of Nrl-GFP™ rod precursors 
into one eye and age-matched Gnat1~'~ cells into the contralateral 
eye, as previously described. Additional cohorts received (1) the same, 
but using our original protocol’, (2) dual Gnat1~'~ sham injections to 
both eyes or (3) no injection. As expected, no head-tracking behaviour 
was observed in any Gnatl'~ mouse at scotopic luminance levels 
(Fig. 3c, e) and normal head tracking occurred under photopic con- 
ditions (Supplementary Fig. 2c, d) before transplantation. After 
transplantation, scotopic optomotor responses were only seen in 
Nrl-GFP-treated eyes; mean contrast-sensitivity threshold for Nrl- 
GFP-treated animals was 1.3 + 0.1 compared with 15.0 + 0.6 in wild 
type (Fig. 3c). Murine visual acuity is poorer under scotopic than 
photopic conditions (0.245 + 0.01 versus 0.488 + 0.003 cycles per 
degree in wild type; see also ref. 9) but a visual-acuity threshold of 0.097 
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+ 0.007 cycles per degree was recorded in Nrl-GFP-treated Gnat1~'~ 


mice under the same conditions (Fig. 3e). Moreover, there was a sig- 
nificant positive correlation between both contrast sensitivity (F-test, 
P < 0.01; Fig. 3d) and visual acuity (F-test, P < 0.05; Fig. 3f) and the 
number of integrated NrI-GFP™ rods. Photopic optomotor responses, 
assessed after scotopic testing, remained unchanged (Fig. 3g, h). 
Finally, we assessed vision using the visually guided water-maze 
test'’, which requires cognitive processing of visual information to 
associate a grating with escape from a Y-shaped water maze. Under 
photopic conditions, Gnat1~'~ mice learned and completed the task 
as well as wild type (=70% correct responses; Fig. 4a, Supplementary 
Fig. 2e-g and Supplementary Information) but performed no better 
than chance under scotopic conditions. Gnat1~‘~ mice then received 
(1) dual injections of Nrl-GEP* rod precursors to both eyes, (2) sham 


Figure 4 | Nri-GFP-treated Gnat1~'~ mice can 
solve the visually guided water-maze task under 
scotopic conditions. a, Schematic of water-maze 
apparatus (adapted from ref. 22; see Supplementary 
Information). Mice were trained to associate 
striped grating with escape from water by a hidden 
platform. An animal ‘passes’ a trial by crossing the 
red line (decision point) on the side of the divider 
with the striped grating. b, Pass rate of Nrl-GFP- 
treated (black), sham-injected (dark grey) and non- 
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from Nrl-GFP-treated (green) and wild-type (light 
grey) groups. f, Swim-time latencies (time-to- 
platform) for all (light grey) and correct choice- 
only (dark grey) trials. g, Ability to solve water- 
maze task plotted against integrated Nri-GFP 
photoreceptor number. h, Examples of integration 
in animals that successfully (top; Nrl-GFP-treated, 
number 6) or unsuccessfully (bottom; Nrl-GFP- 
treated, number 5) solved the task, as indicated in 


zee —€ 8 8&8 g (circled, red). Scale bar, 100 jum. i-k, Pass rate 
GE £ 3) = . . + . sass 
sp? ¢ g (i), visual acuity (j) and contrast sensitivity (k) for 
+8 & & Nrl-GFP-treated (light grey bars) and sham- 
5 OG - injected (dark grey bars) Gnat1~'~ mice before and 
= 5 after transplantation under photopic conditions. 
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injections to both eyes or (3) remained untreated. After transplanta- 
tion, four out of nine Gnat1~/~ animals receiving Nrl-GFP* rod pre- 
cursors correctly completed the task in at least 70% of trials under 
scotopic conditions (green bars, Fig. 4b; Supplementary Movie). In 
contrast, all sham and untreated Gnat1~/~ mice performed no better 
than chance (50%). As a group, Nri-GFP-treated animals were signifi- 
cantly better at solving the water-maze task than either control group 
(Fig. 4c). The four Nri-GFP-treated Gnat1 ~~ animals able to solve the 
task correctly further had a contrast sensitivity of 1.3 + 0.04 and visual 
acuity of 0.09 + 0.005 cycles per degree (compared with contrast 
sensitivity of 3.0 + 0.24 and visual acuity of 0.206 + 0.005 cycles 
per degree, respectively, for wild-type controls; Fig. 4d, e). These same 
animals solved the task with an average swim time of 15.1 + 0.8 s 
(correct choice swim-time latency 9.6 + 0.8 s), compared with 26.4 + 1.1, 
26.0 + 1.2 and 7.1 + 14 8 for sham-injected Gnatl~’~, untreated 
Gnat1~’~ and wild-type mice, respectively (Fig. 4f). There was a signifi- 
cant positive correlation between ability to solve the task and integrated 
Nrl-GFP* rod number (F-test, P < 0.01; Fig. 4g, h). Cell integration in 
those animals that performed best in this task was often clustered, rather 
than widely distributed across the retina, suggesting that proximity of 
integrated cells to one another may be important, in addition to absolute 
number. For example, Nrl-GFP-treated animals numbers 1 and 2 had 
similar numbers of integrated rod photoreceptors but number 1 had 
qualitatively more closely grouped clusters of cells (data not shown). 
Ability to perform the water-maze task under photopic conditions, 
assessed after scotopic testing, remained unchanged in all groups after 
transplantation (Fig. 4i-k). 

Although the scotopic visual function recorded here in transplanted 
Gnat1~'~ mice is lower than in wild-type animals, such sensitivity is 
still impressive given that integrated photoreceptors account for less 
than 1% of the total rods in the retina. However, other studies have 
demonstrated a remarkable sensitivity from very few light-sensitive 
cells'’, whereas clinical studies have shown useful vision despite there 
being no detectable ERG response from the retina’’. Many additional 
steps are required before these findings can be translated to the clinic, 
but the results presented here demonstrate for the first time that 
transplanted rod-photoreceptor precursors can integrate into a dys- 
functional adult retina and, by directly connecting with the host retinal 
circuitry, truly improve vision. 


METHODS SUMMARY 


The transplantation procedures were performed as described previously'’** with 
modifications, as described in Supplementary Fig. 1 and Supplementary 
Information. All recipients were adult (6-8 weeks) at time of transplantation 
and functional assessments were made 4-6 weeks after transplantation. Cellular, 
retinal and behavioural assessments of visual function were assessed using suction- 
pipette recordings'*”°, ERGs”, optical imaging'*"*, optomotor”*”*° and watermaze’” 
testing and previously published protocols with amendments as detailed in 


Supplementary Information. 
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Crowding induces live cell extrusion to maintain 
homeostatic cell numbers in epithelia 


George T. Eisenhoffer'*, Patrick D. Loftus'*, Masaaki Yoshigi*, Hideo Otsuna’, Chi-Bin Chien’, Paul A. Morcos* & Jody Rosenblatt! 


For an epithelium to provide a protective barrier, it must maintain 
homeostatic cell numbers by matching the number of dividing cells 
with the number of dying cells. Although compensatory cell division 
can be triggered by dying cells’, it is unknown how cell death might 
relieve overcrowding due to proliferation. When we trigger apoptosis 
in epithelia, dying cells are extruded to preserve a functional barrier’. 
Extrusion occurs by cells destined to die signalling to surrounding 
epithelial cells to contract an actomyosin ring that squeezes the dying 
cell out*®. However, it is not clear what drives cell death during 
normal homeostasis. Here we show in human, canine and zebrafish 
cells that overcrowding due to proliferation and migration induces 
extrusion of live cells to control epithelial cell numbers. Extrusion of 
live cells occurs at sites where the highest crowding occurs in vivo and 
can be induced by experimentally overcrowding monolayers in vitro. 
Like apoptotic cell extrusion, live cell extrusion resulting from 
overcrowding also requires sphingosine 1-phosphate signalling 
and Rho-kinase-dependent myosin contraction, but is distinguished 
by signalling through stretch-activated channels. Moreover, disrup- 
tion of a stretch-activated channel, Piezol, in zebrafish prevents 
extrusion and leads to the formation of epithelial cell masses. Our 
findings reveal that during homeostatic turnover, growth and 
division of epithelial cells on a confined substratum cause over- 
crowding that leads to their extrusion and consequent death owing 
to the loss of survival factors. These results suggest that live cell 
extrusion could be a tumour-suppressive mechanism that prevents 
the accumulation of excess epithelial cells. 

To examine how cells are eliminated during homeostasis, we 
immunostained human colon tissues, developing zebrafish epidermises, 
and cultured Madin—Darby canine kidney (MDCK) epithelial cells for 
active caspase-3 to identify apoptotic cells, and actin and DNA to 
highlight cell borders and extrusion**. Surprisingly, we found that 
predominantly caspase-negative cells extrude from the surfaces of 
colon epithelia (80%, m = 46 extruding cells), from zebrafish epidermis 
(88%, n = 160 extruding cells in three experiments), and from over- 
grown monolayers in culture (67%, n = 300 extruding cells in three 
experiments; Fig. 1a, b, d, e, g,h). By contrast, extrusions resulting from 
inducing apoptosis in zebrafish with G418 or cultured monolayers 
with ultraviolet-C were almost exclusively caspase-3 positive’. These 
findings suggest that during homeostasis and development, live rather 
than dead cells are eliminated by extrusion. Similar live cell extrusions 
have been observed during mammary gland involution’. Because these 
non-apoptotic in vivo extrusions looked identical to apoptotic extru- 
sion, we hypothesized that the extrusion pathway operates in two 
diverging manners: one to maintain homeostatic cell numbers in 
epithelia and the other to remove apoptotic/damaged cells. 

When quantifying extrusion in both adult human colon and 
developing zebrafish epidermis, we noticed that extrusions occurred 
predominantly at the fin edges and colon surfaces, where cell densities 
were highest (1.7- and 1.8-fold higher than fin centre or crypt side 


(Supplementary Fig. la, b), respectively, indicated by yellow cells in 
Fig. Ic, f). Extrusion zones were also more curved in vivo; yet, because 
they occurred most frequently in regions with higher cell density (1.8- 
fold) compared to proliferative regions in flat cell culture monolayers 
(Fig. 1i and Supplementary Fig. 1c), we decided to experimentally test 
if overcrowding strain could induce cells to extrude. 

To simulate overcrowding observed in vivo, we grew MDCK cells to 
confluence on a silicone membrane stretched to 28% of its original 
length and then released it from stretch (Fig. 2a). Within 30 min after 
release, the number of cells per 100 uum? increased 1.3-fold, from 
112 + 5to 144 + 4 (Fig. 2b-f). ZO-1 and B-catenin staining confirmed 
that tight and adherens junctions were still intact and that the average 
cell diameter had decreased by 30 min after crowding (Fig. 2g-i and 
Supplementary Fig. 2). Moreover, crowded monolayers maintained 
adhesion to Cy-5-labelled fibronectin-coated membranes (Fig. 2j). By 
6h after crowding, the number of cells per field equilibrated to pre- 
release levels (110 + 10 cells per 100 um’, see Fig. 2e, f), indicating that 
MDCK epithelia eliminate cells to achieve homeostatic cell numbers. 
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Figure 1 | Both apoptotic and non-apoptotic epithelial cells extrude at 
locations of high crowding during homeostasis and development. 

a, b, d, e, g, h, Non-apoptotic (caspase-negative, arrowheads) and apoptotic 
(caspase-positive, arrows) extrusions at the surface of human colon epithelia 
(a, b), zebrafish epidermis (d, e), and confluent MDCK monolayers 

(g, h). ¢, f, i, Extrusions typically occur at sites of high crowding (indicated by 
yellow cells). Arrows indicate direction of force from mitoses and migration. 
Scale bars, 10 um. 
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Figure 2 | Characterization of experimental overcrowding. a, Experimental 
design for overcrowding monolayers grown on stretched matrix. Only cells in 
the grey area (right) were analysed to ensure a homogeneous strain field. 

b-f, Images of the nuclei of control unstretched monolayers and monolayers 
0.5-6 h after overcrowding (b-e) were quantified in f. f, Mean number of nuclei 
per field are plotted, error bars represent s.e.m. g—-i, Monolayers at 0.5 and 2h 
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after release show that adherens and tight junctions maintain integrity after 
overcrowding, as compared to control. B-cat, B-catenin. j, Monolayer stained 
for actin and DNA at 2h after release on Cy5-labelled fibronectin shows that 
contacts to substratum are maintained. Scale bars, 10 ,tm. ***P < 0.0005, 
**P < 0,005, *P < 0.05. 
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Figure 3 | Overcrowding MDCK monolayers promotes extrusion of non- 
apoptotic cells. a, Quantification of live and apoptotic extruding cells 2 h after 
overcrowding. b, Live cell extrusion requires ROCK, sphingosine kinase (SK) 
and S1P). c, Cell viability 2 or 24h after extrusion. UV, ultraviolet. d, Growth of 
extruded cells collected at 2h post-crowding after 24h and 5 days (d5). 
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e-g, Gd?* inhibits both overcrowding-induced extrusion (e) and homeostatic 
extrusion (g) whereas BCL2 overexpression and JNK inhibition only affect 
ultraviolet-C-induced apoptotic extrusion (f). 1 = 7 independent experiments, 
mean + s.e.m. Scale bars, 100 um. ***P < 0.0005, **P < 0.005, *P < 0.05. 
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Immunostaining for actin and active caspase-3 showed that over- 
crowding induced predominantly non-apoptotic extrusion, as seen dur- 
ing homeostasis (Fig. 3a). Filming crowded MDCK monolayers 
expressing Lifeact-green fluorescent protein (GFP)* to highlight 
F-actin, or the active caspase-3 indicator NucView, confirmed that live 
cells are extruded by contracting actin rings (Supplementary Movie 1 and 
Supplementary Fig. 4). The numbers of live extruding cells eventually 
decreased by 6h after overcrowding, concordantly with the decrease 
in cell densities. The percentage of non-apoptotic extrusion also 
correlated to the percentage of crowding (Supplementary Fig. 3). 

To determine whether live cell extrusion required the same factors 
that control apoptotic cell extrusion, we blocked signals required for 
extrusion during experimental overcrowding. We previously iden- 
tified sphingosine-1-phosphate (S1P) as the signal apoptotic cells 
produce to activate the S1P receptor 2 (S1P,), which triggers Rho- 
mediated contraction to squeeze the dying cells out®. Inhibiting Rho 
kinase (ROCK) or S1P signalling markedly decreased the percentages 
of both non-apoptotic and apoptotic cell extrusions (Fig. 3b). 
Although it was not possible to score non-apoptotic cells blocked from 
extrusion, as they look like any other cells within the monolayer, the 
significant decreases in the percentages of non-apoptotic cell extrusion 
indicated that live cell extrusion, like apoptotic cell extrusion, requires 
S1P signalling through ROCK-mediated actomyosin contraction. 

To assess the fate of overcrowding-induced extruded cells, we 
analysed their viability using flow cytometry and their ability to survive 
and proliferate by plating them on new substrata. After collecting 
extruded cells from the medium 2h after crowding, 45.3 + 2.1% were 
alive, whereas only 21.3 + 2.7% were alive after ultraviolet-induced 
apoptotic extrusion (Fig. 3c and Supplementary Fig. 5a). Over- 
crowding and homeostatically extruded cells were able to proliferate 
into confluent monolayers, whereas those collected from apoptosis- 
induced extrusion were not (Fig. 3d). If, however, cells extruded from 
homeostatic or post-crowded monolayers were collected after 24h, 
most cells died instead of proliferating after re-plating (Supplemen- 
tary Fig. 5b), suggesting that extruded cells typically die unless a new 
substratum is provided. Presumably, during in vivo homeostasis most 
extruded cells die from anoikis, apoptosis due to loss of survival signals 
from engagement with the underlying matrix”""’. 

We next investigated which signals trigger live cell extrusion during 
homeostasis or overcrowding. We previously found that overexpres- 
sion of BCL2 inhibits both apoptosis and extrusion in response to 
apoptotic stimuli’? (Fig. 3f); however, BCL2 overexpression did not 
block live cell extrusion during homeostasis or after overcrowding 
(Figs 3e, g). To investigate signals that might regulate live cell extrusion 
after overcrowding, we tested two candidates that are activated by cell 
stress: the carboxy-terminal JUN kinase (JNK; also known as 
MAPK8)” and stretch-activated channels'*’’. JNK inhibitor blocked 
apoptotic extrusion in response to ultraviolet-C but not live cell extru- 
sion after overcrowding or homeostatic cell turnover (Figs 3e-g). In 
contrast, we found that inhibiting stretch-activated ion channels with 
gadolinium (Gd**)!*"” significantly reduced the percentage of both 
apoptotic and non-apoptotic extrusion events after overcrowding or 
during epithelial homeostasis, yet had no effect on apoptotic cell extru- 
sion in response to ultraviolet light (Fig. 3e-g). When cell extrusion 
is blocked by Gd**, the number of cells per 100 jum* remains high at 
2h after crowding (167 + 12) compared to control crowded cells 
(112 +5), demonstrating that extrusion relieves overcrowded cell 
densities. Thus, whereas BCL2 and JNK control apoptosis-induced 
extrusion, stretch-activated signalling controls live cell extrusion dur- 
ing homeostasis that is induced by overcrowding, presumably 
upstream of S1P signalling (Fig. 41). 

If stretch-activated signalling controls extrusion-mediated 
homeostatic cell turnover, blocking this pathway in vivo should lead 
to accumulation of excess epithelial cells. By immunostaining and film- 
ing the epidermis of developing zebrafish with Tg(cldnb:lynGFP)"*, we 
found that cells proliferate at 37 h post-fertilization (hpf) and migrate 
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Figure 4 | Disrupting the stretch-activated channel Piezo1 in vivo blocks 
live cell extrusion and causes epithelial mass formation. a, Proliferation and 
extrusion rates during zebrafish tail fin development. H3P, phospho-histone 
3. b, Gd°* treatment and piezol photo-morpholino (photo-MO) block 
extrusion (n = 15-17 embryos from three independent experiments, 

mean ~ s.e.m.). ¢ d, f, g, i, j, Bright-field and confocal projections of 
untreated (c, d), Gd?* -treated (f, g), and piezol photo-morpholino 
knockdown (i, j) in 60 hpf developing zebrafish. DIC, differential interference 
contrast. e, h, Tracks from red circle mark the movement of individual 
epithelial cells in control and Gd** -treated Tg(cldnb:lynGFP) developing 
zebrafish. k, Epidermal cell mass from a piezol photo-morpholino 
knockdown in Kaede zebrafish, where yellow cells are photo-converted. 

1, Model of induced cell death versus homeostatic cell death. Arrowheads 
denote cell accumulations and asterisk an extrusion. Scale bars in 

c, f, i, 100 Lum; in d, g, j, 10 um. ***P < 0.0005, **P < 0.005, *P < 0.05. 


to the fin edges between 48-59 hpf, where they extrude at 56-60 hpf 
(Fig. 4a, c-e, Supplementary Fig. 6 and Supplementary Movie 2). 
Treating 32-hpf zebrafish with Gd** blocked homeostatic cell extru- 
sion and resulted in epidermal cell mass formation at the fin edges, the 
number of which was dependent on Gd** concentration (Fig. 4b, f-h, 
Supplementary Fig. 7 and Supplementary Movie 3). Live imaging of 
Gd°* treated Tg(cldnb:lynGFP) zebrafish revealed that the cell masses 
formed at zones of convergence, where cells failed to extrude (Fig. 4h 
and Supplementary Movie 4). 

By knocking down candidate stretch-activated channels in zebrafish 
with morpholino oligonucleotides, we found that live cell extrusion 
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requires the channel Piezol (ref. 14). To prevent developmental 
defects from early knockdown (Supplementary Fig. 8a, b), we used a 
photo-cleavable morpholino (photo-morpholino) to knockdown Piezo1 
at 30 hpf (Supplementary Fig. 8c, d)’’. By 60 hpf, Piezol morphants, like 
Gd?*-treated fish, could not extrude cells and developed epidermal 
masses at the fin edges and other sites of tissue strain (Figs 4b, i-k 
and Supplementary Figs 9 and 10), indicating that Piezol regulates 
extrusion to maintain homeostatic epithelial cell numbers in vivo. 

As extrusion seems to control cell numbers during homeostasis, we 
predict that alterations in extrusion could lead to various epithelial 
pathologies. Aberrant signalling or tensions that block extrusion could 
lead to formation of cell masses, as those produced in zebrafish when 
stretch-activated channels are disrupted. Furthermore, these cell 
masses may set the stage for tumorigenesis by allowing retention of 
oncogenic or defective cells. In support of this, we found increased cell 
densities and no clearly identified extrusions in colon polyp sections 
compared to control sections (Supplementary Fig. 11). Conversely, 
hyper-contraction of bronchial epithelia following bronchoconstric- 
tion in asthmatics could lead to excessive extrusion and characteristic 
epithelial denuding. The resulting poor barrier could lead to the increased 
inflammation and infection seen in asthmatics*””’. Whereas math- 
ematical models have suggested that mechanical forces could control 
tissue homeostasis”’, our results show that cell division and migration 
within epithelia cause overcrowding strain, which induces live cells to 
extrude and later die. Thus, homeostatic live cell extrusion provides the 
previously missing link between proliferation and epithelial cell death 
(Fig. lc, f, i). 


METHODS SUMMARY 


All cell cultures, cell staining, ultraviolet irradiation and imaging were done using 
MDCK cells, as previously described’. Cells were treated with 10M SKI-II 
(Calbiochem), 10 uM JTE-013 (Tocris Bioscience), 10uM SP600125, 10 uM 
Gd?*, (both Sigma-Aldrich), or 1% DMSO as a control. Flow cytometry was done 
using a Beckman-Dickinson FACScan after treating cells with 1 ug per 250 ull of 
propidium iodide (Sigma-Aldrich). The Huntsman Cancer Institute Tissue 
Resource and Applications Core provided human colon sections. Developing 
wild-type AB zebrafish were treated with 10 mM Gd?* at 28.5°C from 32 hpf 
until 60 hpf, and immunostained as described previously’, or filmed with a Nikon 
spinning disc confocal microscope using Andor software. The antisense morpho- 
lino oligonucleotides and photo-morpholino oligonucleotides were acquired from 
Gene Tools, LLC. For the photo-morpholino experiments, the translation block- 
ing antisense morpholino was mixed at a 1:1 molar ratio with a 25 bp sense 
photo-morpholino and injected into 1-2-cell-stage wild-type AB or Et(Gal4- 
VP16)'°***, Te(UAS-E1b:Kaede)"”' zebrafish embryos. At 28-32 hpf, embryos 
were exposed to 350 nm light for 20 s to release the caging sense morpholino, then 
fixed and immunostained at 60 hpf. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cell culture and overcrowding assays. MDCK II cells were grown as described’. 
A custom-designed Teflon chamber was fabricated to culture cells on flexible 
silicone membranes in stretched states (2 X 2cm, 0.5mm thickness)”*”*. Prior 
to cell seeding, 2 X 2cm silicone membranes were stretched and coated with 
5 ug ml! fibronectin (BD Biosciences) at 4°C for 24h. To stretch membranes 
with our device, one edge of the silicone membrane was clamped in place, while the 
other side was clamped to a movable shaft, which moves through a Teflon 
chamber with a sealing gasket. The movable shaft was pulled out to lengths 
representing 11%, 22% and 28% strain. 750,000 cells ml ! were plated onto the 
2 X 2.cm silicone matrices in a stretched state in the device or in a culture dish (for 
non-stretched controls) and grown to confluence, then released from their 
stretched states, thereby crowding cells in monolayer. Epithelial monolayers were 
fixed and stained or filmed using a Nikon 90i wide-field fluorescence microscope. 
BCL2-overexpressing cell lines are described previously’’. 

Drug and ultraviolet assays. Confluent MDCK monolayers on silicone 
membranes were pre-treated with SP600125 JNK inhibitor at 10, 50 and 
100 14M or 10, 50 and 10041M gadolinium III chloride (Gd°*) for 30 min at 
37°C. Apoptosis was induced by exposing monolayers to 1,200pJcm * 
ultraviolet (254 nm) irradiation in a UV series II (Spectroline). Cells were fixed 
and stained from 1-6h post-ultraviolet treatment depending on the experiment. 
The minimal drug concentrations (10 1M) established in the ultraviolet-induced 
extrusion assay were used in the overcrowding and homeostatic assays. For testing 
extrusion following overcrowding, chambers were pretreated with 10 uM JTE-013 
(Tocris Bioscience), 10 1M SKI II (Calbiochem) or 10 1M Y-27632 (Tocris) for 
30 min before overcrowding until 2h after crowding. 

Cell immunostaining. Fixation and staining of MDCK cells for actin, DNA, 
active caspase-3, ZO-1 and B-catenin was carried out as previously described’. 
Fibronectin was labelled with an Amersham Cy5 Ab Labelling Kit (GE 
Healthcare). 

Live cell imaging. Standard MDCK cells or Lifact-GFP-expressing* MDCK cells 
were grown to confluent monolayers and imaged on a Nikon 90i wide field 
fluorescent microscope with the stage kept at 37 °C. For the NucView experiments, 
cells were incubated with the activated caspase-3 indicator NucView at 1:200 for 
30 min before imaging. 

Colon sections. Human colon tissue samples were fixed, imbedded in paraffin, 
and cut into 10 tm sections by the Huntsman Cancer Institute Tissue Resource 
and Applications Core. The sections were deparaffinised and rehydrated by 
incubating in citrus clearing solvent (CCS; Richard Allen Scientific), 100%, 
95%, 80%, 70% ethanol, and PBS. Antigens were retrieved by heating the slides 
in 10 mM sodium citrate at 95 °C for 20 min, then rinsed three times with PBS, 
blocked with 5% BSA/0.5% Tween-20 in PBS for 24h, and incubated overnight 
with active caspase-3 antibody, rinsed five times with PBS, incubated in Alexa- 
488 anti-rabbit antibody, 1g ml! Hoechst, and Alexa-568 anti-actin 
antibody for 2h, rinsed three times in PBS, and mounted in Prolong Gold 
(Invitrogen). 

Microscopy. Images were captured on a CTR6000 microscope (Leica) with a X63 
oil lens using a Micromax charge-coupled device camera (Roper Scientific) or on a 
Nikon Eclipse TE300 inverted microscope converted for spinning disc confocal 
microscopy (Andor Technologies) using a X20 or X60 plan fluor 0.95 oil lens with 
an electron-multiplied cooled CCD camera 1,000 X 1,000, 8 x 8 mm” driven by 
the IQ software (Andor Technologies). Image J was used to stack 8-10 consecutive 
lum confocal sections into Z series, which were then colour-combined and 
reconstructed into a three-dimensional image using Metamorph (GE 
Healthcare). IP Lab 4.0.8 s and Image J software were used to analyse percentages 
of apoptosis and extrusion. For quantification of extrusion in tissue culture, 
extruding cells were manually scored as non-apoptotic extruding, apoptotic 
extruding, or blocked apoptotic extrusions based on the presence of an actin ring 
and caspase-3 staining in 10,000 monolayer cells. 

Statistical analyses. Statistical analysis was done on at least four independent 
experiments for the overcrowding control and time analysis. Three separate assays 
were used for drug treatments and for FACS analysis. The error bars in all figures 
are s.e.m. All P values were determined from a two-sided unpaired Student's t-test 
using GraphPad Prism software. 

FACS and proliferation analysis. Confluent homoeostatic, ultraviolet, and 
overcrowding monolayers were rinsed three times with PBS to remove any pre- 
vious cells in suspension. The media was replaced and collected at each denoted 
analysis time. Media was collected and centrifuged at 1,458g in a Damon/IEC 
Division Clinical Centrifuge for 1 min. The cells were resuspended in 250 pl of 
PBS containing 4 tg ml! propidium iodide and analysed by FACS. Samples were 
analysed on a Beckman Dickinson FACScan and 50-1,000 cells were analysed for 
propidium iodide fluorescence at each condition. For proliferation analysis, 
pelleted cells were instead re-plated in a 96-well dish and representative pictures 
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were taken at 24h and 5 days after re-plating. Results are from three independent 
experiments. 

Zebrafish care and maintenance. Adult zebrafish were maintained under 
standard laboratory conditions, with a regular light/dark cycle of 14h light and 
10h of darkness. Embryos were collected and raised in E3 embryo medium at 
28.5 °C and staged as previously described’®. The Institutional Animal Care and 
Use Committee (IACUC) at the University of Utah (Animal Welfare Assurance 
no. 10-07017) have approved all procedures performed in this protocol using the 
zebrafish Danio rerio. 

Zebrafish drug treatments. Zebrafish embryos were dechorionated at 24 hpf. At 
32 hpf approximately 50-100 embryos were transferred to dishes containing E3 
embryo medium with or without (control) 10 mM Gd?* and allowed to develop at 
28.5°C until 60hpf, when the animals were either imaged live or fixed for 
immunostaining. 

Zebrafish immunostaining. Developing zebrafish larvae were fixed in 4% 
formaldehyde for 1-2h at room temperature (20°C) or overnight at 4°C. 
Fixed specimens were then permeablized by rinses with PBSTx (0.5%) and 
incubated in blocking buffer (1% DMSO, 2 mg ml! BSA, 0.5% Triton X-100 
and 10% goat serum in PBS) for 2 h. Specimens were then incubated overnight in 
primary antibodies for phospho-histone H3 (H3P; Abcam, 1:500) or activated 
caspase-3 (BD Pharmigen, 1:700). Samples were subsequently washed six times 
with PBSTx and then incubated in blocking buffer for 2 h before incubation with 
the appropriate secondary antibodies or Alexa-phalloidin (488/568/647) 
(Invitrogen). After incubation with secondary antibodies or Alexa-phalloidin, 
the specimens were rinsed five times with PBSTx, incubated with DAPI 
(1:1,000) for 15 min, rinsed once more and then mounted in Prolong Gold. 
Depending on the situation, either whole specimens or only tail fragments (from 
the yolk extension back) were mounted for imaging. For quantification of H3P- 
positive cells, the region from the cloaca back to the edge of the tail fin was 
quantified. 

Zebrafish imaging. Developing zebrafish were immunostained as described 
previously® and imaged with either the wide-field fluorescent microscope or a 
spinning disc confocal microscope, described above. For live imaging, wild-type 
or Gd** -treated developing Tg(cldnb:lynGFP) zebrafish, a gift from T. Piotrowski, 
were anaesthetized with 0.02% tricaine in E3, mounted in 1% low melt agarose and 
imaged ona spinning disc confocal microscope using a X20 objective, capturing a 
z-series every 2 min for 3-6 h. Manual tracking of individual cells was performed 
using Metamorph. 

Morpholino antisense oligonucleotide knockdown of Piezol. All of the 
morpholino oligonucleotides and photo-morpholino oligonucleotides used in this 
study were acquired from Gene Tools, LLC. Developing zebrafish embryos were 
injected with 2-4ng of a standard translation blocking antisense morpholino 
oligonucleotide directed against the 5’ untranslated region (UTR) of piezol 
(accesssion XM_691263) at the 1-2-cell stage and then allowed to develop at 
28.5 °C. For the photo-cleavable morpholino experiments, the translation blocking 
antisense morpholino (TBMO) was mixed at a 1:1 molar ratio with a sense-photo 
morpholino, with a 4bp mismatch around the photo-linker (see Supplementary 
Fig. 8c for schematic of strategy) and then injected into 1-2-cell-stage wild-type AB 
or Et(Gal4-VP16)*1°; Tg(UAS-E1 b:Kaede)'" embryos. At 28-32hpf, the 
developing embryos were then exposed to 350nm light for 20s using a X10 
objective on a Nikon 90i wide-field fluorescent microscope to activate the 
morpholino. Some injected embryos were not converted as a control for ‘leakiness’ 
of the sense photo-morpholino. Likewise, some wild-type or Kaede-expressing 
embryos that were not injected were exposed to 350 nm light to ensure photo- 
conversion did not cause any adverse effects. Morpholinos: FAM38A 5’ UTR 
antisense morpholino oligonucleotide!) GAGCGACACTTCCACTCACATT 
CCT; FAM38A 5’ UTR sense photo-morpholino oligonucleotide: AGGAAT 
GTGAaaxttAGTGTCGCTC. 

Western blot analyses of Piezol. piezo] TBMO injected animals (n = 15-20) 
were collected at 28 hpf and homogenized in lysis buffer, boiled at 85 °C for 5 min, 
spun down and the supernatant collected. A Bradford assay was performed to 
assess protein concentration. 10-15 yg of protein was run out on a 3-8% Tris- 
Acetate gel at 150 V for 1 h. Detection was performed with an anti-Piezol antibody 
(Proteintech catalogue no. 15939-1-AP) using standard ECL (enhanced chemi- 
luminescence) methods. HC11, a mammary epithelial cell line, was used as a 
control for protein levels in mammalian tissues. Piezol protein levels are presented 
relative to the o-tubulin loading control. Quantification of the scanned blot was 
performed using Image J. 

Kaede expression in the zebrafish embryonic epidermis. To generate the 
Et(Gal4-VP16;myl7:GFP)'"" enhancer-trap line, a plasmid along with tol2 
mRNA was injected into 1-cell-stage developing zebrafish embryos. The potential 
founders were then crossed to Tg(UAS-E1 b:Kaede)*!°*' (ref. 27) and subsequently 
identified by Kaede expression. The identified F1 transgenic embryos were then 
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Osteoprotection by semaphorin 3A 


Mikihito Hayashi!?", Tomoki Nakashima’?*, Masahiko Taniguchi’, Tatsuhiko Kodama’, Atsushi Kumanogoh®’ 


& Hiroshi Takayanagi!?>* 


The bony skeleton is maintained by local factors that regulate bone-forming osteoblasts and bone-resorbing osteoclasts, 
in addition to hormonal activity. Osteoprotegerin protects bone by inhibiting osteoclastic bone resorption, but no factor 
has yet been identified as a local determinant of bone mass that regulates both osteoclasts and osteoblasts. Here we show 
that semaphorin 3A (Sema3A) exerts an osteoprotective effect by both suppressing osteoclastic bone resorption and 
increasing osteoblastic bone formation. The binding of Sema3A to neuropilin-1 (Nrp1) inhibited receptor activator of 
nuclear factor-xB ligand (RANKL) -induced osteoclast differentiation by inhibiting the immunoreceptor tyrosine-based 
activation motif (ITAM) and RhoA signalling pathways. In addition, Sema3A and Nrp1 binding stimulated osteoblast and 
inhibited adipocyte differentiation through the canonical Wnt/p-catenin signalling pathway. The osteopenic phenotype 
in Sema3a_'~ mice was recapitulated by mice in which the Sema3A-binding site of Nrp1 had been genetically disrupted. 
Intravenous Sema3A administration in mice increased bone volume and expedited bone regeneration. Thus, Sema3<A is a 


promising new therapeutic agent in bone and joint diseases. 


Bone homeostasis has long been thought to be predominantly main- 
tained by the endocrine system by calcium regulating hormones, but 
increasing evidence indicates that bone is also under the control of 
factors related to immune and neuronal regulation’’. As an imbalance 
between bone resorption and formation results in metabolic bone 
disorders such as osteoporosis**, understanding the balancing 
mechanisms is important for the development of therapeutic agents. 
Because bone formation is linked to resorption through coupling 
factors®®, treatment with anti-resorptive agents results in simultan- 
eous suppression of bone formation, with the result that the efficacy is 
compromised*”. It is thus crucial to identify a molecule that can 
regulate both resorption and formation synchronously. 

Osteoclasts are derived from monocyte/macrophage precursor 
cells, and their differentiation is regulated by mesenchymal cells, such 
as osteoblasts, chondrocytes and osteocytes, which express the key 
osteoclast differentiation factor RANKL*’*'’. Osteoblastic cells 
counterbalance the function of RANKL by producing a soluble decoy 
receptor for RANKL, osteoprotegerin (Opg), the name of which 
indicates it is a protector of bone’*. Here we show that conditioned 
medium from Opg-deficient mouse calvarial cells contains factors 
that inhibit osteoclast formation, and one of these factors is the axon 
guidance molecule Sema3A. Sema3A inhibits osteoclast differenti- 
ation and promotes osteoblastic bone formation, and is thus a potent 
osteoprotective factor produced by osteoblastic cells. 


Sema3A mediates anti-osteoclastogenesis in osteoblasts 


The conditioned medium of osteoblastic cells was able to inhibit 
osteoclast differentiation of bone marrow-derived monocyte/macro- 
phage precursor cells (BMMs) stimulated by RANKL in the presence 
of macrophage colony-stimulating factor (M-CSF) (Fig. la). We 
observed a substantial anti-osteoclastogenic effect in the conditioned 
medium of calvarial cells lacking the Tnfrsfl1b gene (encoding Opg in 
mice) (Fig. la), suggesting the presence of one or more other soluble 


inhibitory factors. To identify the osteoblast-secreted proteins that 
inhibit osteoclast differentiation, we fractionated the conditioned 
medium of Tnfrsfl1b~'~ calvarial cells by anion-exchange liquid 
chromatography (Fig. 1b). We found that fractions 8-10 exerted a 
potent inhibitory effect on osteoclast differentiation (Fig. 1b). The 
proteins in fraction 8 were separated by SDS-polyacrylamide gel 
electrophoresis (SDS-PAGE; Fig. 1c), and the major bands were 
excised and analysed by liquid chromatography-tandem mass spec- 
trometry (LC-MS/MS). Among the identified proteins (Fig. 1d), we 
focused on the axon guidance molecule Sema3A™*'* (Supplementary 
Fig. la), as recent studies have suggested that axon guidance 
molecules are involved in the interaction between osteoblasts and 
osteoclasts'*'*. Previous reports suggest that Sema3A expressed in 
the skeletal system may have a role in the regulation of innervation 
and blood vessel invasion and contribute to skeletal patterning’, 
but the function of Sema3A in the regulation of bone remodelling 
remains unknown. 

Western blot analysis confirmed that Sema3A protein was present 
in fractions 8 and 9 (Supplementary Fig. 1b), and that the inhibitory 
effect of fraction 9 is largely mediated by Sema3A, as this effect was 
abrogated by the addition of soluble Nrp1, which functioned as a 
Sema3A decoy receptor (Supplementary Fig. 1c). The addition of 
recombinant Sema3A potently inhibited osteoclast differentiation in 
a dose-dependent manner when Sema3A was added before RANKL 
treatment (Fig. le). When Sema3A was added after RANKL treat- 
ment, the inhibitory effect was not observed (Fig. le). Sema3a was 
predominantly expressed in osteoblast lineage cells among various 
cells examined (Supplementary Fig. 2a), whereas Sema3A was not 
detected in osteoclasts (Supplementary Fig. 2a—-c). The expression 
of Sema3a messenger RNA in isolated osteocytes and osteoblasts 
was comparable (Supplementary Fig. 2d). Sema3a mRNA expression 
in calvarial cells was higher than that of any other semaphorin family 
member tested (Supplementary Fig. 2e). 
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Figure 1 | Identification of Sema3A as an inhibitory factor of osteoclast 
differentiation. a, Effect of wild-type (WT) and Opg-deficient (Tnfrsfl1b ‘~) 
calvarial cell-conditioned medium on osteoclast differentiation. MNC, 
mononuclear cells. b, Fractionation of Opg-deficient calvarial cell-conditioned 
medium by anion-exchange chromatography, and the effect of each fraction on 
osteoclast differentiation (black bars). Absorbance at 280 nm is indicated as a red 
line and the concentration of NaCl as a blue line. Inset shows TRAP staining of 
osteoclast cultures treated with fractions 3, 9 and 15. c, Coomassie brilliant blue- 
stained SDS-PAGE image of fraction 8. M, molecular mass. d, List of the 
identified proteins (that is, more than 300 on the MASCOT score), their 
theoretical molecular mass, MASCOT score and the number of non-redundant 
peptides. e, Effect of Sema3A treatment on osteoclast differentiation. Error bars 
(a, b and e) denote mean + s.e.m. **P < 0.01; ***P < 0.005; NS, not significant. 


Sema3A regulates osteoclast differentiation via Nrp1 
Microcomputed tomography along with bone morphometric and 
radiographic analyses showed that Sema3a‘~ mice!® have a severe 
osteopenic phenotype both in trabecular and cortical bones, accom- 
panied by an increase in the osteoclast number and eroded surface 
(Fig. 2a, b and Supplementary Fig. 3a—c). There was no difference in 
the number of osteoclast precursor CD11b°”’ CD3e B220 c- 
fms” c-kit* cells” in the bone marrow of wild-type and Sema3a_‘~ 
mice (Supplementary Fig. 3d). 

When osteoclast formation was analysed in a coculture of bone 
marrow and calvarial cells, the formation of tartrate-resistant acid 
phosphatase (TRAP)-positive multinucleated osteoclasts was markedly 
enhanced in Sema3a ‘~ cells (Fig. 2c and Supplementary Fig. 3e, f). 
This enhanced osteoclastogenesis was not observed when BMMs were 
stimulated by RANKL and M-CSF (Supplementary Fig. 3g). In addi- 
tion, enhanced osteoclastogenesis was observed in the coculture of 
wild-type bone marrow cells and Sema3a_‘~ calvarial cells, but not 
in the coculture of Sema3a_'~ bone marrow cells and wild-type calvarial 
cells (Fig. 2c and Supplementary Fig. 3e, f). These results indicate that 
the osteoblastic expression of Sema3A inhibits osteoclastogenesis. The 
level of RANKL and Opg in the calvarial cells or the serum was not 
affected by Sema3A deficiency (Supplementary Fig. 3h, i). 

Sema3A binds to a receptor complex of the ligand-binding subunit 
Nrp1 and one of the class A plexins (PlxnA1, PlxnA2, PlxnA3 and 
PlxnA4), which function as the signal-transducing subunit”.We 
found that Nrp1 expression in BMMs was rapidly and markedly 


2 | NATURE | VOL 000 | 00 MONTH 2012 


suppressed after RANKL stimulation (Fig. 2d). Because Sema3A- 
induced inhibition of osteoclastogenesis was observed only when 
Sema3A was added before RANKL stimulation (Fig. le), we proposed 
that Sema3A does not inhibit osteoclastogenesis after RANKL stimu- 
lation owing to Nrp1 downregulation. 

When Nrp1 was overexpressed by retroviral transfer, Sema3A 
exerted an inhibitory effect even when Sema3A had been added after 
RANKL stimulation (Supplementary Fig. 3j). Notably, osteoclasto- 
genesis was inhibited by Nrp1 overexpression only (Supplementary 
Fig. 3j). When Nrp1 expression was knocked down by short hairpin 
RNA (shRNA), the inhibitory effect of Sema3A on osteoclast differ- 
entiation was abolished (Supplementary Fig. 3k). Thus, the level of 
Nrp1 correlates with the inhibitory effect of Sema3A on osteoclasto- 
genesis, suggesting that the Nrp] downregulation caused by RANKL 
signalling is important for proper osteoclast differentiation by can- 
celling the inhibitory effect of Sema3A. 

Furthermore, we analysed knockin mice in which the Nrp1 gene 
was replaced by mutant Nrp1 lacking the Sema-binding site 
(Nrp1°""*~ mice), as Nrp1-'~ mice are embryonically lethal and 
Nrp1 also contains the vascular endothelial growth factor (VEGF)- 
binding site**. Sema3A did not inhibit RANKL-induced osteoclast 
differentiation in Nrp1°*"*~ cells (Supplementary Fig. 31), showing 
that Sema3A inhibits osteoclastogenesis by binding to Nrpl. As 
expected, Nrp1°*™*~ mice showed an osteopenic phenotype accom- 
panied by enhanced osteoclast differentiation, which was similar to 
Sema3a ‘~ mice (Fig. 2e, f and Supplementary Fig. 3m-o). 

To understand the mechanism of the RANKL-induced inhibition 
of Nrp1 expression, we examined the involvement of the transcription 
factors nuclear factor-«KB (NF-«B), c-Fos and nuclear factor of acti- 
vated T cells cl (NFATc1), which are all activated by RANKL’. 
RANKL-induced downregulation of Nrp1 expression was abolished 
by an NF-kB inhibitor, but was not affected by the deficiency of 
NFATcl1 or c-Fos (Supplementary Fig. 4a—c). Chromatin immuno- 
precipitation analysis showed that NF-«B p65 and, to a lesser extent, 
p50 were recruited to the proximal NF-«B-binding site of the Nrp1 
promoter after RANKL stimulation (Supplementary Fig. 4d, e). NF- 
«B p65 and p50 inhibited Nrp1 promoter activity in a reporter gene 
assay (Supplementary Fig. 4f). Retroviral overexpression of p65 in 
BMMs led to Nrp1 downregulation in the absence of RANKL, which 
was further facilitated by p50 overexpression (Supplementary Fig. 4g). 
These effects were dependent on histone deacetylases (Supplementary 
Fig. 4g, h), suggesting that the recruitment of corepressors~ by 
RANKL-stimulated NF-«B is involved in Nrp1 downregulation. 


Mechanism of anti-osteoclastogenesis by Sema3A 


The binding of RANKL to its receptor RANK results in the activation 
of tumour-necrosis factor (TNF) receptor-associated factor 6 
(TRAF6), which stimulates the NF-«B and mitogen-activated protein 
kinase (MAPK) pathways’. RANKL also activates the activator pro- 
tein 1 (AP-1) transcription factor complex, including c-Fos, which 
cooperates with NF-«B to induce NFATcl1, thus activating the tran- 
scription of osteoclast-specific genes’’®. The robust induction of 
NFATcl1 is dependent on calcium signalling (costimulatory signalling) 
stimulated by the ITAM-bearing adaptor molecules DNAX-activation 
protein 12 (DAP12) and Fc receptor common y subunit (FcRy), which 
associate with immunoglobulin-like receptors such as triggering 
receptor expressed on myeloid cells 2 (TREM2) and osteoclast- 
associated receptor (OSCAR)’””. 

From these observations, the question arises as to how the 
Sema3A-Nrp1 axis inhibits osteoclastogenic signalling. RANKL- 
stimulated induction of the osteoclastic genes Ctsk, Acp5 and Nfatcl 
was severely impaired by Sema3A without affecting the expression of 
Tnfrsflla or Csflr (Fig. 3a). There was no difference in the cell 
proliferation rate or the percentage of apoptotic cells among the 
osteoclast precursor cells between the Sema3A-treated and the 
control cells (Supplementary Fig. 5a, b). RANKL-induced activation 
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of the signalling pathways downstream of TRAF6, including the 
MAPKs (such as ERK, JNK, p38) and inhibitor of KB (IkB) kinases, 
was comparable in BMMs with or without Sema3A (Supplementary 
Fig. 5c). 

PlxnA1 promotes osteoclast differentiation by activating the ITAM 
signal through the formation of the PlxnA1-TREM2-DAP12 complex 
in response to ligands such as Sema6D"’. However, PlxnA1 is consti- 
tutively associated with Nrp1l, which mediates Sema3A signalling 
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Figure 3 | Inhibition of osteoclast differentiation by Sema3A—Nrp1 
signalling. a, Effect of Sema3A treatment on osteoclastic gene expression in 
BMMs treated with RANKL for 2 days. b, Effect of Nrp1 expression on the 
association of PlxnA1 with TREM2. IP, immunoprecipitation. c, Effect of 
Sema3A treatment on the formation of a complex of PlxnA1 with Nrp1 or 
TREM2/DAP12 in RANKL-treated BMMs. d, Transwell assay of the effect of 
Sema3A on M-CSF-induced migration of BMMs derived from wild-type or 
Nrprems— mice. e, Effect of Sema3A treatment on the activation of RhoA in 
BMMs stimulated with M-CSF. Error bars (a, d and e) denote mean + s.e.m. 
*P < 0.05; **P < 0.01; ***P < 0.005. 


(n = 4-5). Error bars (a-c, e and f) denote 
mean + s.e.m. **P < 0.01; ***P< 0.005. 


instead of TREM2—DAP12 signalling’. With increasing Nrp1l 
expression, the amount of TREM2 associated with PlxnA1 decreased 
and Nrp1 associated with PlxnA1 increased (Fig. 3b). 

RANKL induced the formation of the PlxnAl-TREM2-DAP12 
complex by the downregulation of Nrp1, thereby releasing PlxnAl 
from the PlxnAl-Nrpl complex (Fig. 3c). Sema3A treatment 
inhibited RANKL-induced formation of the PlxnAl-TREM2- 
DAP12 complex by inhibiting Nrp1 downregulation and maintain- 
ing the PlxnAl-Nrpl complex (Fig. 3c). The cell surface and 
intracellular expression of Nrpl was highly downregulated by 
RANKL treatment (Supplementary Fig. 5d), but Sema3A treatment 
induced the internalization of Nrpl, as already reported”, and 
protected RANKL-induced Nrp1 downregulation without altering 
Nrp1 mRNA expression (Supplementary Fig. 5d, e). 

RANKL-induced tyrosine phosphorylation of phospholipase Cy2 
(PLCy2) and calcium oscillation were both markedly blocked by 
Sema3A treatment (Supplementary Fig. 5f, g). We observed that 
osteoclast differentiation in DAP12-deficient (also known as Tyrobp- 
deficient) bone marrow cells was not enhanced even in a coculture 
with Sema3a_‘~ calvarial cells (Supplementary Fig. 5h), suggesting 
that Sema3A-induced inhibition is mediated by the modulation of 
DAP12-induced ITAM signalling. Thus, Nrpl competes with 
TREM2 for PlxnAl, thereby functioning as a suppressor of the 
PlxnAl-TREM2-DAP12-induced costimulatory signal. Sema3A- 
induced inhibition of osteoclast differentiation was less observed in 
the presence of Sema6D (Supplementary Fig. 5i). 

We examined the effect of Sema3A on the migration of BMMs 
because the semaphorin-plexin system regulates actin cytoskeletal 
rearrangement’*”’, We observed a repulsive effect of Sema3A on 
M-CSF-induced migration of BMMs (Fig. 3d). By contrast, this 
repulsive effect was not observed in Nrp1*°™*~ BMMs (Fig. 3d). 
Because semaphorin-plexin signalling regulates the Rho family of 
small GTPases!*”*, we examined the effect of Sema3A treatment on 
M-CSF-induced activation of the RhoA and Rac GTPases. Sema3A 
treatment abrogated RhoA activation in response to M-CSF (Fig. 3e), 
but not Rac activation (Supplementary Fig. 5j), suggesting that the 
inhibition of RhoA activation is involved in the inhibitory effect of 
Sema3A on the migration of BMMs. 


Sema3A regulates osteoblasts through the Wnt pathway 


In addition to an osteoclastic phenotype, both Sema3a_‘~ and 
Nrp1**™*~ mice were found to have osteoblastic and adipocytic 
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phenotypes (Fig. 4a and Supplementary Fig. 6a—c); that is, they had a 
decreased osteoblast number, a reduced bone formation rate and a 
markedly increased adipocyte number (Fig. 4b and Supplementary 
Fig. 6d, e) without any significant difference in the weight of the 
epididymal white adipose tissue per body weight (Supplementary Fig. 
6f, g). Taken together, the severe osteopenic phenotype in Sema3a /~ 
and Nrp1°°"*~ mice was caused by both a decrease in the osteoblastic 
bone formation and an increase in osteoclastic bone resorption. 

Calvarial cells obtained from Sema3a‘~ or Nrp1°°™* mice were 
cultured in an osteogenic medium with or without Sema3A. Alkaline 
phosphatase activity and bone nodule formation were markedly 
decreased in Sema3a~‘~ and Nrp1se™— cells, and Sema3A treatment 
facilitated the differentiation of Sema3a ‘~ calvarial cells into osteo- 
blastic cells, but not Nrp1°**"*~ cells (Fig. 4c, d and Supplementary 
Fig. 6h-k). Neither the cell proliferation rate nor the percentage of 
apoptotic cells was affected in both types of mutant cells (Sup- 
plementary Fig. 61, m). Adipocyte differentiation was highly increased 
in both Sema3a~/~ and Nor cells, and Sema3A treatment 
blocked the differentiation of wild-type and Sema3a ‘~ cells into 
adipocytes, but not Nrp1°°"*~ cells (Fig. 4e and Supplementary Fig. 
6n, 0). In Sema3a_‘~ cells, the expression of the osteoblast genes 
Runx2, Sp7 (which encodes osterix), Alp! and Bglap (encoding osteo- 
calcin) was strongly suppressed (Supplementary Fig. 7a), and the 
expression of the adipocyte genes Pparg, Cebpa, Fabp4 (encoding 
aP2) and Lpl (encoding lipoprotein lipase) was highly increased 
(Supplementary Fig. 7b). These results indicate that Sema3A activates 
osteoblast differentiation and inhibits adipocyte differentiation 
through Nrp1. 

Because the mRNA expression levels of the known regulators of 
mesenchymal cell differentiation*’*’ were comparable in wild-type 
and Sema3a ‘~ cells (Supplementary Fig. 7c), we performed gene 
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Figure 4 | Impaired osteoblast differentiation and increased adipocyte 
differentiation in Sema3a ‘~ and Nrp1 Sema~ mice, a, Toluidine blue staining 
of the proximal tibiae of wild-type, Sema3a_‘~ and Nrp1°°"*~ mice (left). New 
bone formation was determined by calcein double labelling (right). 

b, Osteoblastic and adipocytic parameters measured by histomorphometric 
analysis of wild-type, Sema3a ‘~ and NrpI**"*” mice (n = 4-6). c, Alkaline 
phosphatase (ALP) staining of wild-type and Sema3a™'~ calvarial cells cultured 
in osteogenic medium with or without Sema3A. d, Bone nodule formation in 
wild-type and Sema3a ‘~ calvarial cells cultured in osteogenic medium with or 
without Sema3A. e, Adipocyte differentiation in wild-type and Sema3a_‘~ bone 
marrow stromal cells cultured in adipogenic medium with or without Sema3A. 
Error bars (b-e) denote mean + s.e.m. *P < 0.05; **P < 0.01; ***P < 0.005. 
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expression profiling of the calvarial cells derived from Sema3a ‘~ 


mice to obtain insight into the Sema3A-activated molecular pathways 
in osteoblasts. Gene set enrichment analysis in Sema3a ‘~ cells 
showed a significant downregulation of the gene sets involved in 
the Wnt signalling pathway and the Wnt-related signalling pathways 
(Supplementary Fig. 7d and Supplementary Table 1). We therefore 
focused on the canonical Wnt pathway, as it is known to promote 
osteoblast differentiation and inhibit adipocyte differentiation*’*’. 
The mRNA expression of most of the transcriptional targets of 
B-catenin was considerably reduced (Supplementary Fig. 7e), and 
the Wnt3a-induced nuclear accumulation of B-catenin was sup- 
pressed in Sema3a_'~ calvarial cells (Fig. 5a). 

The Sema3A signalling pathway induces activation of the small G 
protein Racl through FARP2 (FERM, RhoGEF and _ pleckstrin 
domain protein 2), which is a guanyl-nucleotide exchange factor 
(GEF) specific for Racl (ref. 34). Previous studies suggested a regulatory 
role of FARP2 in transduction of semaphorin-induced repulsive cues 
in axons, via Racl activation**. Because Racl promotes the nuclear 
localization of §-catenin in response to canonical Wnt ligands**, we 
examined the activation of Rac proteins and RhoA in Sema3a ‘~ cells. 
The activation of Rac, but not RhoA, in Sema3a~'~ calvarial cells in 
response to Wnt3a treatment was significantly decreased (Fig. 5b and 
Supplementary Fig. 7f) as compared with that in wild-type control 
cells. In addition, Sema3A treatment facilitated the nuclear trans- 
location of B-catenin and the activation of Rac in Sema3a '~ cells 
(Fig. 5a, b). The ectopic expression of a dominant negative form of 
FARP2 (AGEF-FARP2)* in calvarial cells resulted in the inhibition of 
osteoblast differentiation, even in the presence of Sema3A (Fig. 5c and 
Supplementary Fig. 7g). When AGEF-FARP2 was overexpressed, 
reduced nuclear accumulation of B-catenin was observed and 
Sema3A treatment had no effect on its nuclear localization (Fig. 5d). 
These results indicate that Sema3A stimulates the canonical Wnt/B- 
catenin signalling pathway, at least in part, through FARP2-mediated 
activation of Racl during osteoblast differentiation. 


Sema3A as an osteoprotective therapeutic agent 


To determine the in vivo effect of Sema3A administration on bone 
metabolism, 5-week-old male mice were intravenously injected with 
recombinant Sema3A or saline once a week. After four weeks of treat- 
ment, the trabecular bone volume and trabecular parameters in the 
distal femur were increased in the Sema3A-treated mice (Fig. 6a and 
Supplementary Fig. 8a). Bone morphometric analysis showed a 
decrease in osteoclastic parameters and an increase in osteoblastic para- 
meters (Fig. 6b, c and Supplementary Fig. 8b-e), suggesting that 
Sema3A exerts a bone-increasing effect by stimulating osteoblastic bone 
formation and inhibiting osteoclastic bone resorption synchronously. 
We could not detect any pathological findings in vital organs or any 
behavioural abnormalities after the Sema3A injection (data not shown). 
The number of osteoclast precursor cells and osteoprogenitor cells in 
the bone marrow was not influenced by Sema3A administration 
(Supplementary Fig. 8f and data not shown), but bone marrow 
mesenchymal cells derived from Sema3A-treated mice tended to 
differentiate into osteoblasts instead of adipocytes in vitro, although 
the number of colony forming units was unchanged (Supplementary 
Fig. 8g, h). 

We further investigated the therapeutic potential of Sema3A in a 
bone regeneration model of cortical bone defects induced by drill 
hole injury’. Microcomputed tomography analysis showed that the 
regenerated cortical bone volume in Sema3A-treated mice was higher 
than in saline-treated mice (Fig. 6d and Supplementary Fig. 8i). The 
significantly increased osteoblast surface and decreased osteoclast sur- 
face around the injured region were observed by histomorphometric 
analysis (Fig. 6e). These results indicate that the local administration of 
Sema3A into the injured site accelerates bone regeneration, although 
we cannot rule out the possibility that Sema3A exerted a bone protec- 
tive effect partly through the regulation of innervation. 


©2012 Macmillan Publishers Limited. All rights reserved 


a WT Sema3a/ WT Sema3a/- 
Wnt8a - - + 4+ - = + + = = + + = = + + 
Sema3A - + - + - + = + = + = + = 4+ = + 
B-catenin eS eee 2 eee en aaa eee 
o-tubulin - Eee 
Histone Hi oo — cr 
Nuclei Cytosol 
b - c 
=] WT Bl semase oO pMX-mock 
eee, spemeae HB pix-aGer-FARp2 
3 300 
NS 


8 tekek 
@ 02 3 & 200 
Do ge kx 
oO co 
= 2 o 2 
3 3! 5 § 100 I 
6 8 goo 
us 
) oH = 
Control 30 60 (min) Control Sema3A 
Wnt3a 
d pMx- pMx- pMx- pMx- 
mock AGEF-FARP2 mock AGEF-FARP2 
Wnt3a - + - + + - + - + + 
Sema3A = - = = + = = = = + 
B-catenin —_— ——_——_—— -——" = 
a-tubulin -_—~ —— 
Histone H1 —_ | =—=—- -- -— 


Nuclei Cytosol 


Figure 5 | Regulation of osteoblast differentiation by Sema3A through 
canonical Wnt signalling. a, Analysis of nuclear B-catenin levels in the wild- 
type and Sema3a ‘~ calvarial cells stimulated by Wnt3a in the presence or 
absence of Sema3A. Sema3A was simultaneously added with Wnt3a. b, Effect 
of Sema3A treatment on the activation of Rac in the wild-type and Sema3a_‘~ 
calvarial cells treated with Wnt3a. c, Effect of retrovirus-mediated 
overexpression of AGEF-FARP2 on bone nodule formation in calvarial cells 
cultured in the osteogenic medium in the presence or absence of Sema3A. 

d, Effect of retrovirus-mediated overexpression of AGEF-FARP2 on the nuclear 
localization of §-catenin in calvarial cells treated with Wnt3a in the presence or 
absence of Sema3A. Error bars (b, c) denote mean + s.e.m. ***P < 0.005. 


We examined the effect of Sema3A administration on bone loss in 
an ovariectomized mouse model of postmenopausal osteoporosis. 
Ovariectomized 9-week-old mice were treated with a weekly intra- 
venous injection of Sema3A starting two days after ovariectomy and 
continuing for four weeks. Sema3A administration decreased bone 
loss after ovariectomy both by inhibiting osteoclastic bone resorption 
and promoting osteoblastic bone formation (Fig. 6f, g and Sup- 
plementary Fig. 8j-l). Recombinant human SEMA3A suppressed 
osteoclastogenesis and promoted osteoblastogenesis in cultured 
human cells (Supplementary Fig. 9a, b). These results indicate that 
Sema3A is a promising potential therapeutic target for bone diseases. 


Conclusions 

This study demonstrates that the Sema3A expressed by osteoblast 
lineage cells functions as a potent osteoprotective factor by synchro- 
nously inhibiting bone resorption and promoting bone formation 
(Supplementary Fig. 10a-c). Sema3A represents the long sought 
soluble molecule with the capacity to bring both osteoblasts and 
osteoclasts into a condition that favours bone mineral increase. 
Bone remodelling consists of resorption, transition and formation 
phases, and the transition phase is under the control of classical coupling 
factors such as insulin-like growth factor and transforming growth 
factor-B, which link bone resorption with formation**”’. Sema3A may 
have a crucial role in the bone formation phase, in which osteoblasts 
extensively produce bone, and at the same time restrain osteoclasts from 


ARTICLE 


C1 Control HB Ssema3a 
Control Sema3A 404 2507 seek 
Be, a 3% soo 
o@ © 2 = 
e€e o+ 1504 
2250] —o 2 
gg Ee = 1001 
ag eee 
s g 107 Ses 50] 
1mm a ie) 8s 0 
b Control ; Sema3A 
mi 7 > eae 
Paes wae eee £ c Control [ll Sema3A 
W/ FexO) 1 Pat 2 
oe ; Par 5 64 ox th 
= Pe Koes By a gg 0 _ 
- ie : g © 4 
A ae 
5 58 “= 304 
i os ga 
© 8B 24 Qe 20) 
2 oo oo 
ire] 62 £298 104 
© gs foe 
5 bo Bo 
3 
ec LC Control 
d HB Sema3A 
80 < 30 eo l2 * 
a€ - OS ge 
oS 60 58 £8 
Ze 3S 20 » 38 81 = 
53 49 a3 a5 
lone) oo 2a 
a 8 8210 B24 
2S 20 BS 35 
Oe @ Bo 
os [om] 65 
2 0 a 0 
f OVX OVX + Sema3A 


Sham 


} 
Q 


Oo ovx 
HB ovx + Sema3A 


= 
np 
& 
Oo 


bone surface (%) 
nO 
Oo 
Bone formation rate 


» 


tek 


NS 
3 


wo 
o 
H 


fo} 


=~ 


= 
Oo 
= 


Bone volume per 
tissue volume (%) 
Eroded surface per 

per bone surface 

(mm? cr? year“) 

ie) 


Oo 


0 


Oo 


Figure 6 | Sema3A as a potential bone-increasing agent. a, Microcomputed 
tomography analysis of the femurs of 9-week-old wild-type mice treated with 
Sema3A or saline control (n = 4-7). b, Histological analysis of the proximal 
tibiae of wild-type mice treated with Sema3A or saline control. c, Parameters for 
osteoclasts and osteoblasts in the bone morphometric analysis of wild-type 
mice treated with Sema3A or saline control (n = 4-7). d, Microcomputed 
tomography analysis of bone regeneration of femoral cortex after drill-hole 
injury (n = 5). e, Histomorphometric analysis of the injured site of the femur 
(TRAP and haematoxylin staining; n = 5). CB, cortical bone. f, Microcomputed 
tomography analysis of the femurs of the sham-operated (Sham), 
ovariectomized (OVX) and Sema3A-treated OVX mice (n = 4-5). 

g, Parameters for osteoclastic bone resorption and osteoblastic bone formation 
in the bone morphometric analysis of the Sham, OVX and Sema3A-treated 
OVX mice (n = 4-5). Error bars (a, c-g) denote mean + s.e.m. *P < 0.05; 
**P < 0.01; ***P < 0.005. 


migrating to the formation sites and starting to resorb the newly formed 
bone. The Sema3A protein level in the serum or bone microenviron- 
ment could be an auspicious biomarker for bone turnover, as we 
observed that the serum level of Sema3A decreased with age in mice 
(data not shown). 

The potent anti-osteoclastogenic function of Sema3A is tightly 
controlled by Nrp1l expression regulated by RANKL signalling. 
Unless Nrp1 is downregulated by RANKL, the Sema3A-Nrp1 axis 
inhibits osteoclast differentiation by sequestering PlxnAl from 
TREM2 so as to suppress ITAM signalling, and also inhibits RhoA 
activation via Nrpl-PlxnA to suppress osteoclast precursor cell 
migration (Supplementary Fig. 10b). After RANKL reduces Nrp1 
expression, PlxnA1 associates with TREM2 and DAP12, which facilitate 
the ITAM-mediated calcium signalling required for osteoclast differ- 
entiation (Supplementary Fig. 10b). Because PlxnAl, PlxnA2 and 
PlxnA3, but not PlxnA4, are expressed by osteoclast and osteoblast 
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lineage cells (data not shown), the relative contribution of these recep- 
tor components should be explored in the future. 

Therapeutic agents capable of increasing bone formation have 
essentially been unavailable except for parathyroid hormone or 
anti-sclerostin antibody’. This study may provide a molecular basis 
for the development of a combined anti-resorptive and bone-increasing 
agent capable of facilitating bone regeneration. 


METHODS SUMMARY - 


Mice and bone analysis. The generation of Sema3a ‘~, NrpI*°™*”, Fos‘, 
Nfatcl '~, Tnfrsfl1b ‘~ and Tyrobp ‘~ mice was described previously’?! 
All mice were maintained under specific pathogen-free conditions. All animal 
experiments were approved by the Institutional Animal Care and Use 
Committee of Tokyo Medical and Dental University. Three-dimensional micro- 
computed tomography analyses and bone morphometric analyses were performed 
as described'*'**°*°, The radiographs were obtained with a high-resolution soft 
X-ray system (SOFTEX). 

Quantitative RT-PCR analysis and GeneChip analysis. Real-time quantitative 
PCR with reverse transcription (RT-PCR) analysis was performed as 
described''**°*", In brief, total RNA was extracted by ISOGEN (NIPPON 
GENE) according to the manufacturer’s instructions. First-strand complementary 
DNAs were synthesized using Superscript III reverse transcriptase (Invitrogen). 
Quantitative RT-PCR analysis was performed with the LightCycler apparatus 
(Roche Applied Science) using SYBR Green Realtime PCR Master Mix 
(TOYOBO). All primer sequences are available on request. GeneChip analysis 
and gene set enrichment analysis were performed as described previously**. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cell culture. For in vitro osteoclast differentiation in the monoculture system, 
primary bone marrow cells (1 X 10° cells per cm*) were suspended in culture 
medium («-MEM containing penicillin, streptomycin and 10% FBS) supplemented 
with 10ngml ' M-CSF (R&D Systems) for twodays to obtain BMMs. The 
resultant BMMs were further cultured in medium supplemented with 10ngml ! 
M-CSF and 5-50ngml_' RANKL (PeproTech) for three days. Culture medium 
was changed every second day. Where indicated, calvarial cell-conditioned 
medium with or without soluble Nrp1 (R&D Systems) was added 12h before 
the RANKL stimulation. Sema3A-Fc (R&D Systems) was added 12h before or 
12h after the RANKL stimulation. For Sema6D treatment, BMMs were collected 
and seeded onto culture plates coated with soluble recombinant Sema6D (R&D 
Systems), and cultured with 10ngml-* M-CSF and 5ngml_* RANKL for three 
days. For the generation of osteoclast in vitro in the coculture system, primary bone 
marrow cells (5 X 10* cells per cm?) and calvarial cells (5 X 10° cells per cm”) were 
cultured in the presence of 10nM 14,25-dihydroxyvitamin D3 and 1M 
prostaglandin E2 for 4-6days. For human osteoclast differentiation, human 
peripheral blood mononuclear cells were separated from peripheral blood 
obtained from healthy volunteers by density gradient centrifugation with 
Lymphoprep (AXIS-SHIELD). Cells (2 X 10° cells per 0.5 ml) were cultured in 
o-MEM with 10% FBS supplemented with 30 ng ml ' M-CSF for two days. The result- 
ant preosteoclasts were further cultured in medium supplemented with 30ng ml! 
M-CSF and 60 ng ml’ RANKL for four days. Culture medium was changed every 
second day. The differentiation of osteoclasts was evaluated by TRAP staining. A 
NF-«B activation inhibitor (6-amino-4-(phenoxyphenylethylamino)quinazline; 
Calbiochem) was used to inhibit NF-«B activity. The concentration of intracellular 
calcium was measured as described”. For in vitro osteoblast differentiation, calvarial 
cells were isolated from the calvarial bone of newborn mice by enzymatic digestion 
in a-MEM with 0.1% collagenase and 0.2% dispase, and were cultured with o-MEM 
with 10% FBS. After two days, cells were reseeded (1  10* cells per cm’) and cultured 
with osteogenic medium (100mM ascorbic acid, 5mM f-glycerophosphate and 
10nM dexamethasone). Culture medium was changed every third day. After seven 
days, ALP staining and activity measurement were performed, and after 21 days, 
bone nodule formation was assessed by alizarin red staining. Human mesenchymal 
stem cells (Lonza) were cultured according to the manufacturer’s protocol. To induce 
adipocyte differentiation in vitro, primary bone marrow cells (5 X 10° cells per cm”) 
were cultured with «-MEM containing 10% FBS. After 24h, non-adherent cells were 
removed and adherent cells were cultured with adipogenic medium (0.5mM 
3-isobutyl-1-methylxanthine, 5ygml~' insulin and 14M dexamethasone) for 
10-14 days. Culture medium was changed every second day. Lipid accumulation 
in adipocytes was determined with Oil Red O staining. Cell proliferation was deter- 
mined using a cell proliferation ELISA kit (Roche Applied Science). The percentage 
of apoptotic cells was determined by TUNEL (TdT-mediated dUTP nick end 
labelling) staining with the MEBSTAIN apoptosis kit direct (MBL). For colony 
forming unit (CFU) assays, primary bone marrow cells (2.5 X 10° cells per cm”) 
were seeded and cultured with MesenCult basal medium supplemented with 
mesenchymal stem cell stimulatory supplements (StemCell Technologies). On 
day 10, the cells were stained with toluidine blue. For CFU-ALP and CFU- osteoblast 
(CFU-OB) assays, primary bone marrow cells (2.5 X 10° cells per cm”) were seeded 
and cultured with MesenCult basal medium and mesenchymal stem cell stimulatory 
supplements plus 100 mM ascorbic acid, 5mM f-glycerophosphate and 10nM 
dexamethasone. On day 10, CFU-ALP colonies were stained for ALP. Mineral 
deposition was determined with von Kossa staining of CFU-OB colonies on day 25. 
Primary osteoblasts and osteocytes were isolated from the long bones of CAG-CAT- 
EGFP/Dmp1-Cre double transgenic mice as described’. 

Purification and identification of inhibitory factor of osteoclast differenti- 
ation. Mouse calvarial cells were statically cultured with «-MEM supplemented 
with 1% FBS. Medium was conditioned for 72h and filtered to remove non- 
adherent cells and debris. Conditioned medium was concentrated by ammonium 
sulphate precipitation (40% saturation), and the pellet was suspended in 10 mM 
sodium phosphate buffer, pH 7.4, and desalted using a PD-10 column (GE 
Healthcare). Concentrated conditioned media were then loaded onto a Mono 
Q 5/50 column (GE Healthcare) in 10 mM sodium phosphate buffer, pH 7.4, and 
proteins binding to the Mono Q matrix were eluted by a gradient of 0-100% 1M 
NaCl and 10 mM sodium phosphate buffer, pH 7.4. Protein concentrations were 
determined by the absorbance at 280 nm. The effect of the fractionated condi- 
tioned media on osteoclast differentiation was examined by directly adding each 
fraction to RANKL-induced osteoclast differentiation in vitro. Fractions 33-35 
contained a high concentration of NaCl, which exerts an inhibitory effect on 
osteoclast differentiation. Proteins of the highly inhibitory fractions were dis- 
solved in SDS-PAGE sample buffer (Nacalai Tesque) and the sample was 
resolved by SDS-PAGE. Protein bands were visualized by Coomassie brilliant 
blue staining and all the protein bands were excised by scalpel. The samples were 
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analysed using nano-LC-MS/MS by Japan Bioservice. The data were submitted to 
the MASCOT program for identification. 

Immunohistochemical staining. After fixation in 4% paraformaldehyde, bone 
tissues were decalcified in 10% EDTA at 4°C for 2 weeks and embedded in 
paraffin after dehydration. For immunohistochemical staining, antigen retrieval 
was carried out with 10 mM citric acid, pH 6.0, at room temperature for 2h. After 
quenching of endogenous peroxidase activity by incubation with 3% H2O, in 
methanol, the sections were incubated with an anti-Sema3A polyclonal antibody 
(Santa Cruz Biotechnology) in immunoreaction enhancer solution (Can Get 
Signal immunostain, TOYOBO) at 4 °C for overnight. After washing with PBS, 
the sections were incubated with peroxidase-conjugated secondary antibody 
according to the manufacturer’s instructions (histofine, Nichirei Bioscience). 
The signals were visualized with 3,3-diaminobenzidine tetrahydrochloride and 
H,0 . TRAP staining was conducted after the immunostaining. Haematoxylin 
was used for nuclear counterstaining. 

Western blot and immunoprecipitation analyses. Cell lysate or culture 
supernatant of calvarial cells was subjected to western blot analysis using the 
specific antibodies for Nrp1 (Calbiochem), B-actin (Sigma-Aldrich), Sema3A, 
Nrpl, p50, p65, histone H1 (Santa Cruz Biotechnology), phospho-ERK, ERK, 
phospho-JNK, JNK, phospho-p38, p38, phospho-IKKa/B, IKKa, IKKB, phospho- 
PLCy2, PLCy2 (Cell Signaling Technology), B-catenin (Millipore) and «-tubulin 
(MBL). Nuclear proteins were prepared with nuclear extract kit in accordance with 
the manufacturer’s protocol (Active Motif). For immunoprecipitation analysis, 
cells were solubilized in lysis buffer (1% Nonidet P-40 in 50 mM NaCl, 50mM 
Tris-HCl, 5mM EDTA, 1 mM NaF and 2mM PMSF), supplemented with com- 
plete protease inhibitor cocktail (Roche Applied Science). Immunoprecipitation 
was performed by incubation with an anti-Flag M2 (Sigma-Aldrich) or anti- 
PlxnAl antibody (Santa Cruz Biotechnology) followed by the addition of 
dynabeads protein G (Invitrogen). Immune complexes were separated by 
electrophoresis followed by blotting with anti-Flag M2, anti-V5 (Invitrogen), 
anti-Nrpl, anti-PlxnAl, anti-DAP12 (Santa Cruz Biotechnology) and anti- 
TREM2 antibodies (R&D Systems). 

Flow cytometric analysis. For the analysis of bone marrow-derived osteoclast 
precursor cells, a single cell suspension of mouse bone marrow cells was stained 
with anti-CD3¢ (145-2C11, eBioscience), anti-B220 (RA3-6B2, eBioscience), 
anti-CD11b (M1/70, eBioscience), anti-CD115 (AFS98, eBioscience) and anti- 
CD117 (2B8, eBioscience) antibodies. For intracellular staining of BMMs, anti- 
CD11b, anti-CD115 and anti-Nrp1 (R&D Systems) were used. Flow cytometric 
analysis was performed using FACSCantoll with Diva software (BD Biosciences). 
ELISA. Concentrations of soluble RANKL and Opg in serum were determined 
using ELISA kits (R&D Systems), according to the manufacturer’s instruction. 
Retroviral gene transfer. The retroviral vector pMXs-Nrp1-IRES-EGFP was 
constructed by inserting DNA fragments encoding Nrp1 into pMXs-IRES- 
EGFP. The construction of the retroviral vectors pMX-FARP2-IRES-GFP and 
pMX-AGEF-FARP2-IRES-GFP was described previously’*. For the construction 
of the retroviral vectors pSI[REN-RetroQ-ZsGreen-shNrp1 and pSIREN-RetroQ- 
ZsGreen-shControl, RNA targeting regions with a hairpin sequence (Nrp1 
shRNA sense: 5'-GCCCGAATGTTCTCAGAACTACTCGAGTAGTTCTGA 
GAACATTCGGGCTTTTT-3'; Nrp1 shRNA antisense: 5’-AAAAAGCCCGA 
ATGTTCTCAGAACTACTCGAGTAGTTCTGAGAACATTCGGGC-3’; control 
shRNA sense: 5'-GTGCGTTGCTAGTACCAACTTCAAGAGATTTTTTACGC 
GT-3’; control shRNA antisense: 5’-ACGCGTAAAAAATCTCTTGAAGTTGG 
TACTAGCAACGCAC-3’) were inserted into RNAi-ready pSIREN-RetroQ- 
ZsGreen (Clontech). The retrovirus supernatants were obtained by transfecting 
the retroviral vectors into the Plat-E packaging cell line using FuGENE 6 (Roche 
Applied Science). 

Chromatin immunoprecipitation assay. Chromatin immunoprecipitation 
assay was performed using the ChIP-IT express chromatin immunoprecipitation 
kit (Active Motif) according to the manufacturer’s instructions. The antibodies 
used for immunoprecipitation were anti-p50, anti-p65 and normal rabbit IgG 
(Santa Cruz Biotechnology). The primer sequences were as follows: Nrp1 region 
1, 5'-CATACGTGACCTTGCGCTCT-3’ and 5’-CCTGGCTGGAGATTCAGA 
GA-3'; Nrp1 region 2, 5'-ACCTTACCCACCAGCTCCTT-3’ and 5'-ATACG 
CCACCCACTTACGAG-3’; Nrp1 region 3, 5'-ATGTGGCTTGGTGAAAG 
GAG-3' 5’-TGCTTCTACCTTCGGGTGAT-3’. 

Reporter gene assay. The reporter plasmid Nrp1-Luc was constructed by sub- 
cloning a 3,259 base pair fragment of the 5’ flanking region of the mouse Nrp1 
gene into the pGL3-basic vector (Promega). The reporter plasmids and the 
expression plasmids were transfected into NIH3T3 cells using FuGENE 6 
(Roche Applied Science). After 36 h, dual luciferase assay was performed accord- 
ing to the manufacturer’s protocol (Promega). 

Migration assay. BMMs suspended in complete medium were added to the upper 
chamber of transwell units (Corning). Inserts were placed into the lower chambers 
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of transwell units containing M-CSF with or without Sema3A-Fc. After incuba- 
tion, cells were fixed with 4% paraformaldehyde and stained with 0.5% toluidine 
blue. The cells on the upper side of the membrane were removed and the cells that 
had migrated to the lower side of the membrane and chamber were counted. 
G-LISA small G protein activation assay. RhoA and Rac GTPase activation were 
determined using the G-LISA RhoA and Rac absorbance-based assay 
(Cytoskeleton) according to the manufacturer’s instructions. In brief, cell lysates 
were prepared and normalized. After the addition of antibodies against RhoA or 
Rac and the incubation with the horseradish peroxidase detection reagent, signals 
were detected with a spectrophotometer. 

In vivo treatment with recombinant Sema3A. Five-week-old C57BL/6 mice 
were given weekly intravenous injections of 1mg per kg body weight of 
Sema3A-Fc or vehicle for four weeks. Three days after the last injection, bone 
analysis was performed as described earlier. 

Bone regeneration model. Skeletal injury was generated as described previ- 
ously*’. In brief, C57BL/6 mice were anaesthetized with an intraperitoneal 


injection of pentobarbital sodium. A 5-mm longitudinal incision was made 
over the proximal femur and the bone surface was exposed by splitting the 
muscle. A 0.5-mm hole was made by drilling through the anterior portion of the 
diaphysis of the bilateral femurs. After four and seven days of surgery, femoral 
defects were treated with Sema3A-Fc (0.5 mg per kg body weight) by injection 
into the injury site. Mice were euthanized at day 14 after surgery and bone 
analyses were performed. 

Ovariectomy-induced bone loss. Nine-week-old female mice were ovariectomized 
or sham operated. More than five mice were examined in each group. 
Ovariectomized mice were given weekly intravenous injections of 1 mg per kg body 
weight of Sema3A-Fc or vehicle for four weeks. Three days after the last injection, all 
of the mice were euthanized and subjected to bone analysis as described earlier. 
Statistical analyses. Statistical analyses were performed using the unpaired two- 
tailed Student’s t test (*P < 0.05; **P < 0.01; ***P < 0.001; NS, not significant, 
throughout the paper). All data are expressed as the mean + s.e.m. Results are 
representative examples of more than three independent experiments. 
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The phytohormone auxin acts as a prominent signal, providing, by 
its local accumulation or depletion in selected cells, a spatial and 
temporal reference for changes in the developmental program’ ” 

The distribution of auxin depends on both auxin metabolism 
(biosynthesis, conjugation and degradation)* and cellular auxin 
transport''"'°. We identified in silico a novel putative auxin trans- 
port facilitator family, called PIN-LIKES (PILS). Here we illustrate 
that PILS proteins are required for auxin-dependent regulation of 
plant growth by determining the cellular sensitivity to auxin. PILS 
proteins regulate intracellular auxin accumulation at the endoplasmic 
reticulum and thus auxin availability for nuclear auxin signalling. 
PILS activity affects the level of endogenous auxin indole-3-acetic acid 
(IAA), presumably via intracellular accumulation and metabolism. 
Our findings reveal that the transport machinery to compartment- 
alize auxin within the cell is of an unexpected molecular complexity 
and demonstrate this compartmentalization to be functionally 
important for a number of developmental processes. 

Prominent auxin carriers with fundamental importance during 
plant development are PIN-FORMED (PIN) proteins’***. PIN1- 
type auxin carriers regulate the directional intercellular auxin transport 
at the plasma membrane. In contrast, atypical family member PINS 
regulates intracellular auxin compartmentalization into the lumen of the 
endoplasmic reticulum and its role in auxin homeostasis was recently 
identified’*’*. PIN proteins have a predicted central hydrophilic loop, 
flanked at each side by five transmembrane domains. We screened 
in silico for novel PIN-like putative carrier proteins with a predicted 
topology similar to PIN proteins (Fig. la and Supplementary Fig. 2) 
and identified a protein family of seven members (Fig. 1b) in 
Arabidopsis thaliana, which we designated as the PILS proteins. In 
contrast to the similarities in the predicted protein topology, PIN 
and PILS proteins do not show pronounced protein sequence identity 
(10-18%), which limits the identification of PILS proteins by con- 
ventional, reciprocal basic local alignment search tool (BLAST) 
approaches. However, the distinct PIN and PILS protein families 
contain both the InterPro auxin carrier domain which is an in- 
silico-defined domain, aiming to predict auxin transport function 
(http://www.ebi.ac.uk/panda/InterPro.html). The PILS putative carrier 
family is conserved throughout the whole plant lineage, including 
unicellular algae (such as Ostreococcus tauri and Chlamydomonas 
reinhardtii) (Supplementary Fig. 3) where PIN proteins are absent’®, 
indicating that PILS proteins are evolutionarily older. 

PILS genes are broadly expressed in various tissues (Fig. 1c) and 
PILS2-PILS7 were transcriptionally upregulated by auxin application 
in wild-type seedlings (Fig. 1d-fand Supplementary Fig. 4), indicating 
a role in auxin-dependent processes. To investigate the potential func- 
tion of the putative PILS auxin flux facilitators in plant development, 
we overexpressed PILS proteins using the constitutive, viral 35S 
promoter. Ectopic expression of PILS genes, such as PILS1 or PILS3, 


resulted in dwarfed and/or bushy plants showing severe defects in flower 
development, leading to sterility in the T1 generation (Fig. 2a—d). 
Flowers of these PILS-overexpressing plants showed severe patterning 
defects, such as homeotic transformation of flower organs into new 
flower buds, triplication of the gynoecium or unfused carpels (Fig. 2b—-d). 
To circumvent sterility, we screened for weaker p35S::PILS lines and 
isolated moderately PILS5 overexpressing lines showing fertile flower 
development. 

To assess further the developmental importance and potential 
es of PILS a in auxin-regulated processes, we focused 
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Figure 1 | Novel PILS protein family. a, Predicted topology of both PIN1-/ 
PIN5-type PIN subfamilies and PILS proteins. b, Phylogenetic tree of the A. 
thaliana PILS proteins. Scale depicts 0.05 substitutions per position. c, Reverse 
transcription PCR (RT-PCR) of the seven PILS genes and EIF4A (control) in 
several plant tissues. d, e, 1-Naphthaleneacetic acid (NAA; 10 UM, 12 h)- 
induced pPILS5::GFP-NLS (GFP fused to nuclear localisation signal, NLS) 
expression in the root transition zone. Colour-code (black to white) depicts 
(low to high) GFP signal intensity. Propidium-iodide-stained cell walls in red. 
Scale bar, 25 um. f, Quantitative RT-PCR of PILS2-7 after 10 uM NAA 
treatment for 1, 3 and 6 h. Colour code from blue (low) to red (high) depicts fold 
changes (0-30) (see also Supplementary Fig. 4). 
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Figure 2 | Phenotypes of PILS loss- and gain-of function mutants. 

a-d, Wild-type (WT) and T1 transgenics strongly expressing p35S::PILS, suchas 
p35S::PILS3 (a), are sterile and show dwarfy growth. b-d, Flowers of WT 

(b), p35S::PILS1 (c) and p35S::PILS3 (d). e, Hypocotyl length of dark-grown 
pils2-2 pils5-2, p35S::PILS5::GFP and WT seedlings (n > 15). f, Dark-grown WT 
and p35S::PILS5 seedlings (5 days after germination). g, pPILS2::GFP-NLS (left) 
and pPILS5::GFP-NLS (right) expression in the root transition zone (green). 
Propidium-iodide-stained cell walls in red. Scale bar, 25 tm. h, i, Root length 
(h) and lateral root density (14 days after germination) (i) of pils2-2, pils5-2, 
pils2-2 pils5-2, p35S::PILS5::GFP and WT seedlings (n > 40). Error bars 
represent s.e.m. Student t-test P-values: *P < 0.05, **P < 0.001, ***P < 0.0001. 


on PILS2 and PILS5, because they are the most abundantly expressed 
PILS genes in seedlings (Fig. 1c) and display partially overlapping 
expression domains (Fig. 2g). Initially, we investigated PILS function 
in auxin-dependent hypocotyl and root growth. Dark-grown 
p35S::PILS5::GFP-expressing (coding for a PILS5-green fluorescent 
protein (GFP) fusion) and pils2 pilsS5 double-mutant seedlings 
showed reduced and enhanced hypocotyl growth, respectively 
(Fig. 2e and Supplementary Fig. 5). PILS5 gain-of-function also 
resulted in agravitropic hypocotyl growth (Fig. 2f). PILS2 and PILS5 
showed a particular overlapping expression in the root transition zone 
(Fig. 2g), suggesting a redundant role in regulating root growth. 
Indeed, pils2 single-mutant and more pronounced pils2 pils5 double- 
mutant seedlings showed significantly longer roots compared to wild- 
type seedlings, whereas seedlings overexpressing PILS5 had shorter 
roots (Fig. 2h). Collectively, our data indicate the requirement of 
defined PILS protein activity for auxin-dependent growth regulation. 
Beside the root and shoot organ growth, auxin tightly controls de 
novo organ formation such as lateral root organogenesis’. Intriguingly, 
pils2 and pils5 single-mutants and more pronounced pils2 pils5 
double-mutants showed higher lateral root density (Fig. 2i). On the 
contrary, PILS5 gain-of-function reduced lateral rooting (Fig. 2i). 
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These findings indicate developmental importance of PILS proteins 
in auxin-regulated processes, such as de novo organ formation and 
growth regulation. 

Next we investigated whether auxin responses are affected in pils 
mutants using the auxin response reporter DR5 (ref. 17). pils2-2 
knockdown and pils5-2 knockout mutants did not show altered DR5 
activity in the main root tips (Supplementary Fig. 6a), but showed 
higher pDRS5rev::GFP signal intensity in lateral roots (Fig. 3a). 
Moderately p35S::PILSS::GFP-expressing seedlings showed a visibly 
reduced auxin response maximum in the very root tip of main 
(Fig. 3b) and lateral roots (Supplementary Fig. 6b). To distinguish 
between direct and indirect effects, we investigated the effect of PILS 
proteins on nuclear auxin signalling at the cellular level. Therefore, we 
transiently co-expressed PILS2 or PILS5 together with the auxin 
response reporter pDR5rev::mRFPer (expressing the red fluorescent 
protein (RFP) in response to auxin signalling) in tobacco Bright 
Yellow 2 (BY-2) cells. PILS2 or PILS5 expression in BY-2 cells reduced 
auxin signalling as visualized by pDR5rev::mRFPer activity (Fig. 3c, d). 
These findings indicate that PILS proteins negatively affect nuclear 
auxin signalling, presumably by affecting cellular auxin homeostasis. 
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Figure 3 | PILS proteins affect auxin-dependent cellular growth. a, b, DR5 
promoter activity in pils2-2 and pils5-2 (lateral roots) (a) and p35S::PILS5::GFP 
(main root) (b) compared to wild type (WT) (see also Supplementary Fig. 6). 
Colour-code (black to white) depicts (low to high) pDR5rev::GFP and 
pDR5rev::mRFPler signal intensity. Scale bars, 25 jum. c, d, Mean grey value of 
pDR5rev::mRFPler signal intensity in BY-2 cells expressing p35S::PILS2 (c) or 
p35S::PILS5 (d) compared to control cells (p35S::GFP::HDEL) (n> 60 cells). 

e, f, Root hair length (e, arrowheads) and quantification (f) of transgenic lines 
expressing pEXP7A::GFP::GUS (control) and pEXP7A::PILS5:GFP (independent 
lines B1 and B4) (n = 20 seedlings with 400 counted root hairs in total). Scale 
bars, 250 tum. g, Relative root length of NAA-treated pils2-2 pils5-2 and 
p35S::PILS5::GFP seedlings compared to WT (n > 20). h, Auxin-induced root 
hair elongation (n = 20 seedlings with 400 counted root hairs in total). Mean 
untreated root hair length was subtracted from the treated values. Error bars 
represent s.e.m. Student t-test P-values: *P < 0.05, **P < 0.001, ***P < 0.0001. 
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To address whether cellular PILS action on auxin signalling affects 
cellular growth, we expressed PILS1, PILS3 and PILS5 under a root- 
hair-specific promoter. Deviations in free (active) indole-3-acetic acid 
(IAA) levels or in auxin signalling induce (high levels) or repress (low 
levels) root hair growth’*. As expected, the root-hair-specific expres- 
sion of PILS1, PILS3 and PILSS significantly reduced root hair length 
(Fig. 3e, fand Supplementary Fig. 7a—c), possibly due to PILS-dependent 
regulation of auxin homeostasis and signalling. 

Next we tested whether PILS proteins affect auxin-dependent 
cellular growth responses. We treated PILS gain- and loss-of-function 
mutants with auxin that inhibits primary root growth. pils2 pils5 loss- 
and PILSS gain-of-function mutants showed hyper- and hyposensitive 
root growth, respectively (Fig. 3g). In agreement with these observa- 
tions, the auxin-promoted root hair growth was also enhanced in pils2 
pilsS double-mutants and reduced in PILS5 overexpressers (Fig. 3h). 
This set of data indicates that PILS putative auxin facilitators modulate 
auxin-dependent growth responses during plant development. 

To unravel the mechanism by which PILS proteins regulate auxin- 
dependent plant development, we investigated the subcellular local- 
ization of PILS proteins. We introduced amino- or carboxy-terminal 
GREEN/RED FLUORESCENT PROTEIN (GFP/RFP) fusions with 
PILS proteins and transiently or stably expressed these fusion proteins 
in tobacco BY-2 cell culture, Arabidopsis seedlings and heterologously 
in yeast (Fig. 4a-c and Supplementary Fig. 8). PILS1, PILS2, PILS3, 
PILS5, PILS6 and PILS7 GFP/RFP fusion proteins localized to the 
endoplasmic reticulum in all analysed systems (Fig. 4a—c and Sup- 
plementary Figs 8 and 10c). In contrast, N- and C-terminal PILS4 
fusions did not show detectable fluorescence. The transgenic 
pPILSS::PILS5::GFP also showed endoplasmic reticulum localization 
(Fig. 4a) and complemented the pils2 pils5 double-mutant to the pils2 
single-mutant level (Supplementary Fig. 9), indicating PILS function at 
the endoplasmic reticulum. 

Next, we addressed whether putative PILS auxin carriers affect cel- 
lular auxin accumulation. We generated PILS2 oestradiol-inducible 
tobacco BY-2 cell cultures and performed *H-IAA accumulation 
assays. PILS2 induction increased the accumulation of radioactivity 
in BY-2 cells (Fig. 4d). In accordance with the auxin accumulation 
assays in BY-2 cells, pils2 pilsS double-mutant protoplasts showed 
significantly higher auxin export (Fig. 4e and Supplementary Fig. 10a), 
indicating reduced auxin retention capacity in pils2 pils5 loss-of- 
function mutants. These gain and loss-of-function studies consistently 
illustrate that PILS2 and PILS5 function at the endoplasmic reticulum 
controls cellular accumulation of auxin. 

To additionally address PILS protein function in a non-plant system, 
we expressed PILS3, PILS5 and PILS7 (tagged to GFP or haemagglutinin 
(HA)) in Saccharomyces cerevisiae yeast cells. In accordance with the 
effect of PILS2 and PILSS in plant cells, the expression of A. thaliana 
PILS3, PILS5 and PILS7 in yeast also increased retention of exogenously 
applied auxin (Fig. 4f and Supplementary Fig. 10b). To assess the spe- 
cificity to auxin we used the common organic control benzoic acid (BA). 
Notably, PILS3, PILS5 and PILS7 did not affect the cellular accumula- 
tion of BA (Fig. 4f and Supplementary Fig. 10b). These findings indicate 
that PILS proteins specifically regulate cellular auxin accumulation. 

PILS proteins increase cellular auxin accumulation, but decrease 
auxin signalling, which is reminiscent of the regulation of auxin 
metabolism shown for the PIN5 auxin transporter'®. To assess the 
potential action of PILS proteins on auxin metabolism, we analysed 
the effect of PILS proteins on the auxin metabolism using the inducible 
PILS2 BY-2 cell line. The high-performance liquid chromatography 
(HPLC) chromatogram of IAA showed a PILS2-dependent reduction 
of free IAA within 20 min (Fig. 4g, h), indicating that PILS2 activity 
affects auxin metabolism. 

In accordance with the auxin metabolite profiling in BY-2 cell cultures, 
pils2, pilsS and pils2 pilss mutant Arabidopsis seedlings showed signifi- 
cantly higher free IAA levels compared to wild-type seedlings (Fig. 4i). 
Intriguingly, the ratio of amid auxin conjugates [AA-Glutamate (Glu) 
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Figure 4 | PILS involvement in cellular auxin homeostasis. a—c, GFP—PILS2 
(b), PILS3-GFP (b), PILS5-GFP (a-c) under endogenous (a) or constitutive 
promoter (b, c) in A. thaliana root (a, c), tobacco BY-2 cells (b, left panels) and 
Saccharomyces cerevisiae yeast cells (b, right panel). c, Immunocytochemistry of 
PILS5-GFP and the endoplasmic reticulum marker BIP2. d, 3H-IAA retention 
in tobacco BY-2 cells upon oestradiol (Est.)-induced PILS2 expression (n = 3 
repetitions). e, 7H-NAA export assay in A. thaliana protoplasts of wild type 
(WT), pils2-2, pilsS-2 and pils2 pils5 mutants (n = 3 repetitions; see also 
Supplementary Fig. 10a). f, Accumulation of auxin and benzoic acid in 
Saccharomyces cerevisiae yeast cells transformed with pGPD::PILSS::HA or an 
empty vector (n = 3 repetitions; see also Supplementary Fig. 10b). gh, “H-IAA 
metabolic profile (HPLC chromatogram) in PILS2 non-oestradiol-induced 
(g) and oestradiol-induced (h) tobacco BY-2 cells after 0 (black line) and 20 min 
(red line). i-k, Liquid chromatography-mass spectrometry (LC-MS)-derived 
free IAA levels (i) and ratios of [AA-Asp (j) and IAA-Glu (k) to free IAA in WT, 
pils2-2, pils5-2, pils2-2 pils5-2 and p35S::PILS5::GFP (n = 3 repetitions). Error 
bars represent s.e.m. Student t-test P-values: *P < 0.05, **P < 0.001, 

*** DP < (),0001. Scale bar, 10 um. 


and IAA-Aspartate (Asp) to free IAA was also significantly shifted 
towards free IAA in pils2 and pils5 loss-of-function mutants (Fig. 4j, k). 
Moderately p35S::PILS5::GFP-expressing seedlings showed a visibly 
reduced auxin response maximum in the very root tip (Fig. 3b), but 
only a mild and statistically non significant reduction in free IAA levels 
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at the whole seedling level (Fig. 4i). However, the ratio of [AA-Glu and 
IAA-Asp to free IAA was significantly shifted towards the conjugates 
in p35S::PILS5::GFP seedlings (Fig. 4j, k), indicating a higher rate of 
auxin conjugation. 

These findings are indicative of a PILS function in cellular auxin 
homeostasis by regulating auxin metabolism. Hence, we propose a 
model in which PILS proteins at the endoplasmic reticulum membrane 
facilitate intracellular auxin accumulation, which seems to contribute 
to the possibly compartmentalized regulation of auxin metabolism 
(Supplementary Fig. 1). It is tempting to speculate that auxin conjuga- 
tion could also take place in the endoplasmic reticulum, although the 
molecular components remain to be identified. 

Our in silico and reverse genetics approaches led to the identification 
of a novel family of putative auxin transport facilitators. All our genetic, 
pharmacological, cell biological, physiological and biochemical 
approaches consistently suggest that PILS proteins function at the 
endoplasmic reticulum membrane, regulate intracellular auxin accu- 
mulation and affect free [AA levels, presumably through conjugation- 
based auxin metabolism. The PILS action on cellular auxin homeostasis 
is reminiscent of the function of the atypical PIN family member PIN5 
(ref. 16). We uncovered that an additional, distinct protein family 
regulates intracellular auxin homeostasis. Our analyses of the PILS 
proteins suggest that intracellular auxin transport and, hence, auxin 
compartmentalization might be evolutionarily older than directional, 
cell-to-cell PIN-dependent auxin transport mechanisms. The iden- 
tification of a novel protein family for the regulation of intracellular 
auxin homeostasis highlights the evolutionary and developmental 
importance of intracellular auxin transport. Further studies will 
address the potential interplay or possible diversified function of 
endoplasmic reticulum localized PINS and PILS1-7 proteins. 


METHODS SUMMARY 

Plant material and growth conditions. We used Arabidopsis thaliana of ecotype 
Columbia 0 (Col-0). The Nicotiana tabacum L. cv. Bright Yellow 2 (BY-2) cell 
line’? was used as suspension-cultured cells. pils2-1 (SALK_024808), pils2-2 
(SALK_125391), pils5-1 (SALK_070653) and pils5-2 (SALK_072996) were 
obtained from the Nottingham Arabidopsis Stock Centre (NASC). Gateway 
cloning was used to construct pPILS2::GFP/GUS, pPILSS5::GEP/GUS, 
p35S::PILSI-7, p35S::GFP::PILS1-7, p35S::PILS1-7::GFP, pPILS5::PILS5::GFP 
and pMDC7_B(pUBQ)::PILS2. The following lines and constructs have been previ- 
ously described: pDRSrev::mRFPler®, pDR5rev::GFP’ and p35S::GFP::HDEL”'. 
Seeds were stratified at 4 °C for 2 days in the dark. Seedlings were grown vertically 
on half Murashige and Skoog medium. Plants were grown under long-day (16h 
light/8 h dark) conditions at 20-22 °C. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


PILS gene accession codes. Sequence data from this article can be found in The 
Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org/) or 
GenBank/EMBL databases under the following accession numbers: PILS1 
(Atlg20925), PILS2 (Atlg71090), PILS3 (Atlg76520), PILS4 (Atlg76530), 
PILSS5 (At2g17500), PILS6 (At5g01990), PILS7 (At5g65980). 

Plant material, growth conditions and DNA constructs. We used Arabidopsis 
thaliana of ecotype Columbia 0 (Col-0). The Nicotiana tabacum L. cv. Bright Yellow 
2 (BY-2) cell line’’ was used as suspension-cultured cells. pils2-1 (SALK_024808), 
pils2-2 (SALK_125391), pils5-1 (SALK_070653) and pils5-2 (SALK_072996) were 
obtained from the Nottingham Arabidopsis Stock Centre (NASC). Insertion sites 
were verified, homozygous lines selected and the decrease or absence of the 
respective PILS transcript was shown by RT-PCR. The pils2-2 and the pils5-2 
mutants were crossed into DR5rev::GFP’. Gateway cloning was used to construct 
pPILS2::GFP::GUS, — pPILS5::GFP::GUS, — p35S::PILS1-7, — p35S::GFP::PILS1-7, 
p35S::PILS1-7::GFP, pPILS5::PILS5::GFP and pMDC7_B(pUBQ)::PILS2. The PILS 
full-length genomic fragments, complementary DNA and promoter regions were 
amplified by PCR from genomic DNA and cDNA, respectively. The PCR was 
performed using the high fidelity DNA polymerase “I proof’ (Bio-Rad). The 
primers used are given below. The full genomic and cDNA fragments were cloned 
into the pDONR221 (Invitrogen) vector and the promoter regions into pDONR- 
P4P1 using Invitrogen BP-clonase according to manufacturer's instructions. Coding 
sequences were transferred from the entry clones to gateway-compatible destination 
vectors (given below) using the Invitrogen LR clonase(+) according to manufac- 
turer’s instructions. The resulting constructs were transformed into Col-0 plants by 
floral dipping in Agrobacterium tumefaciens liquid cultures. Yeast vectors were trans- 
formed into budding yeast (Saccharomyces cerevisiae) via electroporation. The 
p35S::PILS5::GFP line was crossed into pDR5rev::mRFPler°. The following lines 
and constructs have been previously described: pDRS5rev::mRFPler®, 
pDR5rev::GFP’ and p35S::GFP::HDEL"". Seeds were stratified at 4°C for 2 days in 
the dark. Seedlings were grown vertically on half Murashige and Skoog medium. 
Plants were grown under long-day (16h light/8 h dark) conditions at 20-22 °C. 
Chemicals. 1-Naphthaleneacetic acid (NAA) was supplied by Duchefa, 2,4- 
dichlorophenoxy acetic acid, oestradiol and propidium iodide by Sigma-Aldrich 
and 3H-indole-3-acetic acid (*H-IAA), 3H-naphthalene-1-acetic acid (*7H-NAA) 
and '*C-benzoic acid (‘*C-BA) (specific radioactivity 20 Cimmol™ 1 by American 
Radiolabelled Chemicals. 

RNA extraction and quantitative real time PCR (qPCR). Whole RNA of 
seedlings was extracted using the RNeasy Mini Kit (Qiagen) in technical triplicates, 
the extracted RNA samples were treated with DNase (Ambion). qPCR analysis was 
performed using ICycler (Bio-Rad) with the Platinum SYBR Green qPCR Super- 
UDG kit (Invitrogen) following recommendations of the manufacturer. GPCR was 
carried out in 96-well optical reaction plates heated for 10 min to 95 °C to activate 
hot-start Taq DNA polymerase, followed by 40 cycles of denaturation for 60s at 
95 °C and annealing-extension for 60s at 58°C. Target quantifications were per- 
formed with specific primer pairs (given later) designed using Beacon Designer 4.0 
(Premier Biosoft International). Expression levels were normalized to the expres- 
sion levels of translation initiation factor EIF4A. The primers used are given later. 
Phenotype analysis. For analysis of the root length and lateral root density, plates 
were scanned on a flat-bed scanner. Root hairs were imaged with a binocular 
microscope (Leica). For hypocotyls analysis, seeds on plates were exposed to light 
for 3h at 18 °C, and cultivated in the dark at 20 °C. Seedlings were imaged in real 
time with an infrared camera (Canon) to define the exact moment of germination 
and analysed 1, 3 and 5 days after germination. Hypocotyls, root and root hair 
lengths were measured with the Image] (http://rsb.info.nih.gov/ij/) software. 
Lateral root density for each seedling was obtained by calculating the number of 
lateral roots per root length unit 14days after germination. For analysis of 
hypocotyls length, a minimum of 15 hypocotyls per condition or mutant line 
were analysed in each experiment. For analysis of root length and lateral root 
density, a minimum of 40 plants per condition or mutant line were analysed in 
each experiment. Means and standard errors were calculated and the statistical 
significance was evaluated by the student t-test using the GraphPad Prism5 
(http://www.graphpad.com) software. For the analysis of root hair growth, 20 
seedlings per transgenic line were imaged by binocular (Leica) and 20 root hairs 
(randomly chosen in the root hair zone) per seedling were measured with the 
ImageJ (http://rsb.info.nih.gov/ij/) software. The mean and standard error of 
the mean per transgenic line were calculated and the statistical significance was 
evaluated by the student t-test. To obtain the auxin-dependent root hair elonga- 
tion, the same number of root hairs per seedling, seedlings per mutant line and 
condition were analysed as described above. The untreated mean average root hair 
length of the respective genotype was subtracted from the individual auxin-treated 
root hair length to obtain auxin induced growth in millimetre. The mean and 
standard error of the mean of the respective genotype were calculated and the 
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statistical significance was evaluated by the student t-test using the GraphPad 
Prism5 (http://www.graphpad.com) software. All experiments were performed 
in at least three independent biological repetitions. 

BY-2 plant material. Cells of tobacco line BY-2 (Nicotiana tabacum L., cv. Bright 
Yellow 2)'° transformed with pMDC7_B(pUBQ)::PILS2 were cultured in liquid 
cultivation medium (3% (w/v) sucrose, 4.3g1~' Murashige and Skoog salts, 
100 mg]? inositol, 1 mg]? thiamine, 0.2 mgl! 2,4-dichlorophenoxy acetic 
acid, and 200 mg1~ 'KHPO, (pH 5.8). BY-2 cell lines were cultivated in darkness 
at 26°C on an orbital incubator (Sanyo Gallenkamp, Schéeller Instruments; 
150r.p.m., 32-mm orbit) and subcultured weekly. Stock BY-2 calli were main- 
tained on media solidified with 0.6% (w/v) agar and subcultured monthly. 
Transient transformation of BY-2 cells and monitoring of cellular auxin sig- 
nalling in BY2. Ten ml of three-day-old cells were harvested on filter paper by 
vacuum filtration and kept on plates with solid BY-2 medium. The cells were 
transformed via particle bombardment with a PDS 1000/He biolistic system 
(Bio-Rad) according to the manufacturer’s instructions (http://www.bio-rad. 
com/webroot/web/pdf/lsr/literature/Bulletin_9075.pdf). 2 ul of plasmid DNA 
(0.05 pg pl of the pDRSrev::mRFPler contruct and 0.1 pg pl | of p35S::PILS2 
and p35S::PILS5) was added to 6.25 il of 1.6-jum diameter gold particles (dissolved 
in 50% glycerol). The suspension was supplemented with 2.5 tl spermidine (0.1 M 
stock solution) and 6.25 jl CaCl, (2.5 M stock solution). The particles were pelleted 
by centrifugation, washed twice with 70% and 100% ethanol and, subsequently, 
resuspended in 10 ul of 100% ethanol. Cells were bombarded under a pressure of 
1,100 pounds per square inch. The plates were sealed with Parafilm and kept in the 
dark for 18h at 25 °C. For microscopic analysis, cells were gently transferred from 
the filter to a microscopy slide (in water) and subsequently covered with a cover 
slip. Samples were analysed via confocal microscopy. The pDR5rev::mRFP ler 
expression was evaluated by defining the mean grey value (MGV) of each imaged 
cell (middle sections). For each experiment, confocal settings were defined based on 
the pDR5rev::mRFP ler signal of the control cells and remained unchanged during 
the respective experiments. Transformants were identified on the basis of the 
fluorescence of both proteins and imaged with a 40 objective. Every experi- 
ment/transformation was done in triplicate and for each condition a total number 
of at least 60 transformed cells were imaged. For each experiment, the means and 
standard errors were calculated and the statistical significance (independence 
between the two populations) was obtained by the student t-test using the 
GraphPad Prism5 (http://www.graphpad.com) software. 
Immunocytochemistry. Whole-mount immunological staining on 5-day-old 
seedlings was done in an Intavis robot according to the described protocol”. 
The antibodies used at the final dilutions were monoclonal mouse anti-BIP2 
(Hsc70) at 1:200 (Stressgen Bioreagents), monoclonal rabbit anti-GFP at 1:600 
(Invitrogen). The secondary anti-mouse (Invitrogen) and anti-rabbit (Sigma- 
Aldrich) antibodies conjugated with Cy3 and Alexa488 respectively were used 
at 1:600 dilution. 

Microscopy. Confocal microscopy was done with a Zeiss 710 microscope (Zeiss) 
or Leica SP5 (Leica). Fluorescence signals for GFP (excitation 488 nm, emission 
peak 509 nm), mRFP1 (excitation 561 nm, emission peak 607 nm) and propidium 
iodide (PI) staining (excitation 536 nm, emission peak 617 nm) were detected with 
a X20, X40 (water immersion) or X63 (water immersion) objective. Sequential 
scanning was used for double labelling to avoid crosstalk between channels. 
Fluorescence signal intensity was analysed with Image] (http://rsb.info.nih.gov/ij/) 
software and data were statistically evaluated with Microsoft Excel 2007. 

Auxin transport assays in tobacco BY-2 cells, baker’s yeast and Arabidopsis 
thaliana protoplasts. Auxin accumulation with 2-day-old BY-2 cells was mea- 
sured as previously described'**>. The *H-IAA was added to give a final concen- 
tration of 2nM. Accumulation results were expressed as pmols of particular auxin 
accumulated per 10° cells. The 0.5-ml aliquots of cell suspension were collected 
continuously and accumulation of label was terminated by rapid filtration under 
reduced pressure on 22-mm-diameter cellulose filters. The cell cakes and filters 
were transferred to scintillation vials, extracted in 0.5ml of 96% ethanol for 
30 min, and afterwards 4 ml of scintillation solution (EcoLite Liquid Scintilation 
Fluid, MP Biomedicals) was added. Radioactivity was determined by liquid 
scintillation counter Packard Tri-Carb 2900TR (Packard-Canberra, Meridian). 
Yeast *H-IAA loading was quantified with the unspecific ''C-benzoic acid as 
control assayed in parallel and performed as previously described**. Relative 
export is calculated from yeast-retained radioactivity as follows: ((radioactivity 
in the yeast at time t= 10min) — (radioactivity in the yeast at time 
t= 0)) X (100%)/(radioactivity in the yeast at t=0min). Unspecific loading 
due to diffusion was eliminated by vector control subtraction. IAA export from 
Arabidopsis thaliana mesophyll protoplasts was analysed as described’’. 

HPLC metabolic profiling in tobacco BY-2 cells. Two-days-old BY-2 cells were 
prepared for the experiment by equilibration in uptake buffer as already described 
for accumulation assays'*. Experiments were done in uptake buffer and under 
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standard cultivation conditions. Cells were incubated with addition of 20 nM *H- 
IAA for a period of 0 and 20 min. Cells and media (uptake buffer) were collected 
and frozen in liquid nitrogen (100mg of fresh weight and 5 ml per sample). 
Extraction and purification of auxin metabolites in cells and media were per- 
formed as described*’°. The metabolites were separated on HPLC consisting of 
autosampler and 235C diode array detector (Perkin Elmer), column Luna C18 (2), 
150 X 4.6mm, 3 um (Phenomenex, Torrance, USA), mobile phase A: 40 mM 
CH;COONH4, (pH 4.0) and mobile phase B: CH;CN/CH3OH, 1/1, (v/v). Flow 
rate was 0.6 ml min! with linear gradient 30-50% B for 10 min, 50-100% B for 
1 min, 100% B for 2 min, 10-30% B for 1 min. The column eluate was monitored on 
235C DAD followed by Ramona 2000 flow-through radioactivity detector (Raytest 
GmbH) after online mixing with three volumes (1.8 ml min’) of liquid scintil- 
lation cocktail (Flo-Scint II, Perkin Elmer). The radioactive metabolites were iden- 
tified on the basis of comparison of their retention times with authentic standards. 
For the results presentation the total integrated area of chromatogram plots has 
been normalized based on the equalization of total accumulated radiolabel. 

In silico and phylogenetic analysis. PILS genes were identified via the SMART- 
protein tool from EMBL (http://smart.embl-heidelberg.de/)””**. Phylogenetic tree 
of AtPILS was constructed with the DNA-man software version 4.0. PILS topologies 
were defined by the online HMM-top tool (http://www.enzim.hu/hmmtop/)”’ and 
visualized by the TMRPres2D software (http://biophysics.biol.uoa.gr/TMRPres2D/ 
downloadjsp)*°. PILS orthologues were identified with the online tool Plaza 
(http://bioinformatics.psb.ugent.be/plaza/)*". 

Free [AA and conjugate measurements in Arabidopsis thaliana. For the quan- 
tification of free IAA and its amino acid conjugates, approximately 10 mg of plant 
material was taken into analysis. The samples were processed as previously 
described”? and quantified by UHPLC-MS/MS. 

Used primers and vectors. Genotyping primers: pils2-1 RP, CTGGAGAAACC 
TGACATCTCG; pils2-1 LP, GATTGAAGCCGGCTTAAATTC; pils2-2 RP, 
CTGGAGAAACCTGACATCTCG; pils2-2 LP, TACCATTGATCTGTCTTCG 
GG; pils5-1 RP, TTGAGACCCGTATCATTGGAG; pils5-1 LP, TGTCCTG 
ATAAAACCTTTTCAGG; pils5-2. RP, TACTGCACCGAAAATGAAACC; 
pils5-2 LP, TTGTACTATTTGCACCGGCTC. 

Insert primer (LBb) (combine with RP): GCGTGGACCGCTTGCTGCAACT. 
RT-PCR primers used for insertion lines: PILS2 Fw, GCGATCATTATCGGATC 
AGT; PILS2 Rev, TTGCATACCTTGGACAGTAGTC; PILS5 Fw, TGTTGA 
AGCCCGTAATTCCATGAAC; PILS5 Rev, TTCATTGCGGACCCTTTAAT 
CAGC. 

qPCR primers: PILS1 Fw, CGGTAACACAGCTCCACTTC; PILS1 Rev, 
GCAACAAGTAACGCACAACC; PILS2 Fw, GTGATGCTTGTACTTGGTGG 
TATG; PILS2 Rev, AACTTGAACATTGGATCTGCTGAG; PILS3 Fw, AGGCG 
ACCATGCAAGTGTTG; PILS3 Rev, GTGGTACAGCTAGATGACAGTGAG; 
PILS4 Fw, TGTCATAACTAAGCCTCCTTCAC; PILS4 Rev, CTCGCAACTCTC 
AGAATCTCC; PILS5 Fw, CTTGGAATAGTCTGTGTTCGGTAC; PILS5 Rev, 
GCACTGAGCATTCGTCTTGAG; PILS6 Fw, GCCTACATCAGTGCTCTCAG; 
PILS6 Rev, GCACTGAGCATTCGTCTTGAG; PILS7 Fw, TCCTCCAGACCCTC 
TCTTTCG; PILS7 Rev, ACAAGAAGATGACCGAGCACTC; EiF4a Fw, 
CTGGAGGTTTTGAGGCTGGTAT; EiF4a Rev, CCAAGGGTGAAAGCAAG 
AAGA. 

Cloning primers: gDNA/cDNA: PILS1_Fw, GGGGACAAGTTTGTACAAAA 
AAGCAGGCTCGATGAGGATGAGGCTTTTGGATC; PILS1_Rev, GGGGAC 
CACTTTGTACAAGAAAGCTGGGTC(TCA)GGCTACGAGCCACATGAAG 
AATG; PILS2_Fw, GGGGACAAGTTTGTACAAAAAAGCAGGCTCGATGT 
CAGGTTTCTCCAGTGGAA; PILS2_Rev, GGGGACCACTTTGTACAAGAAA 
GCTGGGTC(TCA)TTGCATACCTTGGACAGTAGTCTC; PILS3_Fw, GGGG 
ACAAGTTTGTACAAAAAAGCAGGCTCGATGGTGAAGCTTTTGGAGCTG 


TTC; PILS3_Rev, GGGGACCACTTTGTACAAGAAAGCTGGGTC(TCA)AGC 
TACAAGCCACATGAAGAATG; PILS4_ Fw, GGGGACAAGTTTGTACAAAA 
AAGCAGGCTCGATGAAGCTTTTGGAGTTGTTCA; PILS4_Rev, GGGGAC 
CACTTTGTACAAGAAAGCTGGGTC(TCA)TGTCACAAGCCACATGAAGA 
ATG; PILSS_Fw, GGGGACAAGTTTGTACAAAAAAGCAGGCTCGATGGG 
ATTCTGGTCGTTGTTGGA; PILS5_Rev, GGGGACCACTTTGTACAAGAAA 
GCTGGGTC(TCA)GACTAACAAGTGAAGGAAGATGG; PILS6_Fw, GGGGA 
CAAGTTTGTACAAAAAAGCAGGCTCGATGATTGCTCGGATCCTTGCCG; 
PILS6_Rev, GGGGACCACTTTGTACAAGAAAGCTGGGTC(TCA)GAAGAG 
TATGTTAATGTAGAGTAG; PILS7_Fw, GGGGACAAGTTTGTACAAAAAA 
GCAGGCTCGATGGGTTTCTTAGAGTTGTTGGA; PILS7_Rev, GGGGACCA 
CTTTGTACAAGAAAGCTGGGTC(TCA)GGAGAGGATGGAGAGGAAGAT 
GG. 

Promoter: PILS2_Fw, GGGGACAACTTTGTATAGAAAAGTTGCGAACTCC 
ATTGTTAACAGTAATAGC; PILS2_Rev, GGGGACTGCTTTTTTGTACAAA 
CTTGCCTCGATCTCACTATGTAAAGCTCG; PILS5_Fw, GGGGACAACTT 
TGTATAGAAAAGTTGCGCAATATACGTGACGTGGTCCACT; PILS5_Rev, 
GGGGACTGCTTTTTTGTACAAACTTGCCTTTTTATGTGGTTCTTTAGAC. 
Destination vectors: pPILS::GFP/GUS: pKGWES/, p35S::PILS::GFP: pH7FWG2,0, 
p35S::GFP::PILS: pH7WGF2,0, p35S::PILS::RFP: pK7RWG2,0, p35S::RFP::PILS: 
pK7WGR2,0 and p35S::PILS_D: pH7WG2D,1 (ref. 33), and pPILS::PILS::GFP: 
pK7m34GW,0 (ref. 34). 

Oestradiol-inducible PILS: pMDC7_B(pUBQ)* (p35S promoter was exchanged 
by the pUBQ10 promoter). 
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Effective targeted cancer therapeutic development depends upon 
distinguishing disease-associated ‘driver’ mutations, which have 
causative roles in malignancy pathogenesis, from ‘passenger’ muta- 
tions, which are dispensable for cancer initiation and maintenance. 
Translational studies of clinically active targeted therapeutics can 
definitively discriminate driver from passenger lesions and provide 
valuable insights into human cancer biology. Activating internal 
tandem duplication (ITD) mutations in FLT3 (FLT3-ITD) are 
detected in approximately 20% of acute myeloid leukaemia (AML) 
patients and are associated with a poor prognosis’. Abundant 
scientific” and clinical evidence’”, including the lack of convincing 
clinical activity of early FLT3 inhibitors*’, suggests that FLT3-ITD 
probably represents a passenger lesion. Here we report point muta- 
tions at three residues within the kinase domain of FLT3-ITD that 
confer substantial in vitro resistance to AC220 (quizartinib), an 
active investigational inhibitor of FLT3, KIT, PDGFRA, PDGFRB 
and RET®’; evolution of AC220-resistant substitutions at two of 
these amino acid positions was observed in eight of eight FLT3- 
ITD-positive AML patients with acquired resistance to AC220. 
Our findings demonstrate that FLT3-ITD can represent a driver 
lesion and valid therapeutic target in human AML. AC220-resistant 
FLT3 kinase domain mutants represent high-value targets for future 
FLT3 inhibitor development efforts. 

Perhaps the most compelling evidence so far that FLT3-ITD could 
represent a driver mutation in AML was the identification of a secondary 
FLT3 kinase domain mutation that conferred moderate resistance to the 
multikinase inhibitor PKC412 in a single FLT3-ITD* patient who 
relapsed after an initial bone marrow response’. Although the broad- 
spectrum kinase inhibitor sorafenib has anecdotally achieved bone 
marrow remissions in FLT3-ITD* AML patients’, whether its mech- 
anism of action is mediated through inhibition of FLT3 or a distinct 
kinase is unclear. Indeed, two patients who relapsed after an initial 
response to sorafenib had no detectable FLT3 kinase domain muta- 
tions at the time of resistance’®. 

A recent interim analysis of 53 relapsed/refractory FLT3-ITD* 
AML patients evaluable for efficacy in a multinational phase II trial 
of AC220 monotherapy documented a composite complete remission 
(<5% bone marrow blasts) rate of 45% (frequently associated with 
incomplete recovery of peripheral blood counts)’. We sought to use the 
clinical activity of AC220 as a tool to define FLT3-ITD as a driver or 
passenger mutation in human AML. Using a previously validated in 
vitro saturation mutagenesis assay'’, we identified AC220 resistance- 
conferring mutations at four residues in the kinase domain of FLT3- 
ITD (Fig. 1a). Mutations at three of these amino acid positions 
conferred high degrees of in vitro AC220 resistance as demonstrated 
in proliferation (Fig. 1b) and cell-based biochemical assays (Fig. 1c). 
These residues consist of the ‘gatekeeper’ residue (F691) and two 


residues within the activation loop (D835, Y842). For unclear reasons, 
the E608K substitution did not confer substantial AC220 resistance 
and was not further characterized. 

We next assessed the presence of drug-resistant kinase domain 
mutations in FLT3-ITD in eight paired pre-treatment and relapse 
samples obtained from FLT3-ITD* AML patients who initially 
achieved morphological reduction of bone marrow blasts to =5% with 
AC220 monotherapy, but subsequently relapsed despite continued 
AC220 treatment. In every case, subcloning and sequencing” of indi- 
vidual FLT3-ITD alleles revealed mutations at the time of relapse 
(Table 1) that were not detected pre-treatment (Supplementary 
Table 1). Mutations were confined to two of the three critical residues 
identified in our in vitro screen. The activation loop mutation D835Y 
was detected in three cases, D835V in two, and the gatekeeper muta- 
tion F691L was identified in three. Additionally, one novel activation 
loop mutation, D835F, was identified in a single patient. This mutation 
confers substantial in vitro resistance to AC220 (Supplementary Fig. 1) 
and cross-resistance to sorafenib (data not shown), and was probably 
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Figure 1 | Mutation screen of FLT3-ITD reveals secondary kinase domain 
mutations that cause varying degrees of resistance to AC220. a, Numbers of 
independent AC220-resistant Ba/F3 FLT3-ITD subpopulations with amino 
acid substitution at the indicated residue obtained from a saturation 
mutagenesis assay (1 = 97 clones). b, Normalized cell viability of Ba/F3 
populations stably expressing FLT3-ITD mutant isoforms after 48 h in various 
concentrations of AC220 (error bars represent standard deviations of triplicates 
from the same experiment). c, Western blot analysis using anti-phospho-FLT3 
(pFLT3) or anti-FLT3 antibody performed on lysates from IL-3-independent 
Ba/F3 populations expressing the FLT3-ITD mutant isoforms indicated. Cells 
were exposed to AC220 at the indicated concentrations for 90 min. 


1Division of Hematology/Oncology, University of California, San Francisco, California 94143, USA. Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA. 
3Pacific Biosciences, Menlo Park, California 94025, USA. Department of Oncology, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, Maryland 21231, USA. °Abramson Cancer 
Center of the University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA. “Ambit Biosciences, San Diego, California 92121, USA. Helen Diller Family Comprehensive Cancer Center, University of 
California, San Francisco, California 94115, USA. +Present addresses: KINOMEscan Division of DiscoveRx Corporation, San Diego, California 92121, USA VJ.P.H.); Blueprint Medicines Corporation, 
Cambridge, Massachusetts 02142, USA (P.P.Z.); Institute for Genomics and Multiscale Biology, Mount Sinai School of Medicine, New York, New York 10029, USA (E.E.S. and A.K.). 


00 MONTH 2012 | VOL 000 | NATURE]|1 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


Table 1 | Summary of FLT3 kinase domain mutations in patients relapsed on AC220 


Subjectnumber Sex Age Prior therapy Karyotype at enrolment Karyotype at relapse Blasts in relapse ew mutation ITD* clones Weeks on 
(years) sample (%) at relapse with mutation study 
1009-003 F 75 i+3 45~54,XX,+3,+6,+7, 52 XX, +3, 46,1748, 90 D835F 6/15 12 
+8,+13,+14,+21, +10,+12,+13[cp7]/46, 
+22[cp15]/46,XX[5] XX[14] 
1011-006 M 70 7+3, low-dose Norma ND 10 D835Y 4/15 8 
cytarabine 
1011-007 FE 56 7+3, HAM Norma 46,XX,del(11) 80 F691L 4/24 11 
(p?13p?15) D835V 5/24 
[12]/46,XX[9] 
1005-004 F 60 Cytarabine and Norma ormal 92 F691L 9/22 19 
mitoxantrone 
1005-006 M 43 7+3,MEC, allogeneic 6,XY,t(1;15)(p22;q15) D 59 D835Y 8/17 6 
stem cell transplant 
1005-007 F 59 7+3, HDAC Norma ND 39 D835V 9/21 23 
1005-009 M 68 Cytarabine and Norma D 58 D835Y 8/14 19 
mitoxantrone 
1005-010 M 52 7+3, HDAC, 46,XY,t(4;12)(q26;p11.2), ND 22 F691L 6/18 20 


mitoxantrone and 
etoposide 


t(83;14)(q13;q11.2) 


All patients achieved morphological bone marrow 
cytarabine; MEC, mitoxantrone, etoposide, cytarabine. ND, not done. 


not detected in our saturation mutagenesis screen because its creation 
requires a two-nucleotide substitution. One patient (1011-007) 
seemed to have evolved polyclonal resistance, with both F691L and 
D835V mutations detected on separate FLT3-ITD sequences. 
Collectively, these findings suggest that clinical response and relapse 
in each of these eight patients is mechanistically mediated through 
modulation of FLT3-ITD kinase activity. 

To assess more precisely for resistance-conferring mutations at 
relapse, we used a recently described single molecule real-time 
(SMRT; Pacific Biosciences) sequencing platform, which can provide 
sequencing reads of sufficient length to enable focused interrogation of 
FLT3-ITD alleles (Supplementary Fig. 2)'*. With this assay, hundreds 
of reads (range 19-930) spanning the ITD region and kinase domain 
with an average read length of greater than 1 kilobase (kb) 
(Supplementary Table 2) were reliably obtained from individual 
patient samples. Attention was focused on the amino acid codons 
identified in the in vitro screen for AC220 resistance-conferring 
mutations. SMRT sequencing confirmed the presence of resistance- 
conferring kinase domain mutations in FLT3-ITD at relapse in all eight 
patient samples (Table 2). Consistent with the results obtained by 
subcloning and sequencing, mutations at E608 and Y842 were not 
detected. The frequency of individual alternative codon substitutions 
within FLT3-ITD ranged from as low as 2.7% (D835F in patient 1005- 
006) to 50.6% (D835Y in patient 1005-009). The presence of polyclonal 
resistance was confirmed in patient 1011-007, and noted in an addi- 
tional three cases: 1009-003, 1005-006 and 1005-007 (Table 2 and 
Supplementary Fig. 3). In general, mutations were detected on distinct 


lasts of =5% at best response. 7 +3, low-dose cytarabine for 7 days plus 3 days anthracycline; HAM, high-dose cytarabine plus mitoxantrone; HDAC, high-dose 


molecules, although in the case of 1011-007, a subset of FLT3-ITD 
molecules with F691L also harboured a D835V mutation (5/21 obser- 
vations; 23.8% of FLT3-ITD(F691L) alleles; data not shown). Analysis 
of three normal control samples revealed base substitutions at these 
residues at a very low frequency (Table 2 and Supplementary Table 3). 
The evolution of polyclonal resistance due to secondary kinase domain 
mutations in FLT3-ITD in four of eight relapsed patients is supportive 
of a central dependence upon FLT3-ITD signalling in the leukaemic 
clone of a subset of AML patients, and indicative of profound selective 
pressure exerted upon this clone by AC220. Additionally, these find- 
ings reveal the genetic complexity of drug-resistant disease that may 
evolve in cancer patients on clinically active therapy. 

All mutations identified at relapse were detected at a frequency 
significantly higher than that observed in a normal control, and 
although relapse occurred relatively rapidly in some patients, muta- 
tions were not convincingly detectable before treatment. The aggregate 
frequency of all mutations at relapse in individual patients ranged from 
approximately 20-50% in all cases, which is consistent with leukaemic 
blasts homozygous for FLT3-ITD and containing one drug-resistant 
allele per cell, although the presence of a heterogeneous blast popu- 
lation with only a subset of drug-resistant FLT3-ITD* cells expressing 
kinase domain mutations cannot be excluded. 

The five substitutions that conferred a high degree of resistance to 
AC220 in vitro were cross-resistant to sorafenib in cell-based growth 
and biochemical assays (Supplementary Fig. 4). The degree of relative 
resistance to sorafenib associated with these mutants was generally 
similar to that observed with AC220 (Supplementary Table 4). 


Table 2 | Third-generation sequencing identifies polyclonal FLT3 kinase domain mutations 


Pre-treatment Relapse Normal control no. 1 
Subject Mutation Native Alternative Observed alternative Total numberof Observed alternative Total numberof Observed alternative | Total number of 
number codon codon codon frequency in ITD* sequences codon frequencyin |TD* sequences codon frequency (%) | sequences sampled 
ITD* sequences (%) sampled ITD* sequences (%) sampled 
1009-003 D835Y GAT TAT 0.21 482 8.4 332 0.00 768 
D835V GAT GIT 0.00 482 3.3 332 0.13 768 
D835F GAT TTT 0.00 482 10.2 332 0.00 768 
1011-006 D835Y GAT TAT 0.00 196 41.0 402 0.00 768 
1011-007 F691L TIT TTG 0.18 561 6.2 341 0.22 450 
D835Y GAT TAT 0.00 930 3.0 436 0.00 768 
D835V GAT GIT 0.43 930 29.6 436 0.13 768 
1005-004 F691L TIT TTG 0.00 496 29.6 513 0.22 450 
1005-006 D835Y GAT TAT 0.00 171 39.5 261 0.00 768 
D835F GAT TTT 0.00 171 27 261 0.00 768 
1005-007 D835Y GAT TAT 0.00 57 4.0 378 0.00 768 
D835V GAT GTT 0.00 57 A474 378 0.13 768 
1005-009 D835Y GAT TAT 0.00 19 50.6 445 0.00 768 
1005-010 F691L TTT TTG 0.00 387 25.3 150 0.22 450 


All P values <1 x 10~° for alternative codon frequencies at relapse compared to a representative normal control sample (no. 1 refers to one of three normal control samples analysed). 
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To understand the structural effects of AC220-resistance conferring 
mutations, we modelled the binding of AC220 to FLT3 (Fig. 2a). The 
crystal structure of the FLT3 kinase domain has been determined 
previously in an inactive conformation” that resembles the inactive 
conformations of ABL”*, KIT’® and insulin receptor tyrosine kinase”, 
with the activation loop folded back onto the ATP-binding cleft (loop- 
in conformation), thereby preventing substrate loading. The Asp-Phe- 
Gly (DFG) motif at the amino-terminal end of the activation loop 
adopts the DFG-out conformation, in which the Asp side chain, which 
normally coordinates a magnesium ion, is removed from the active 
site. The activation of FLT3 would require flipping of the DFG motif 
and reorganization of the activation loop, as observed in ABL"* and 
insulin receptor kinase’’. Previously published binding data suggest 
that AC220 is a type II kinase inhibitor that preferentially binds to the 
inactive, DFG-out kinase conformation’. Our molecular docking ana- 
lysis supports a model whereby AC220 interacts favourably with the 
DFG-out, inactive conformation. In the docked AC220-FLT3 model, 
the AC220 amide group is 2.6 A from the carbonyl group of C694, 
consistent with the formation of a hydrogen bond. The phenol ring of 
AC220 forms a perpendicular aromatic—aromatic interaction” with 
F830 in the DFG motif (Fig. 2b). This interaction would not be possible 
in the DFG-in, active kinase conformation. The gatekeeper residue 
F691 forms a m-1 stacking contact with the benzo-imidazol-thiazol 
moiety of AC220, which may further stabilize the complex. 
Substitutions at F691 with non-aromatic residues such as leucine 
may not compensate for the n—1 stacking interaction. 

Residues D835 and Y842 stabilize the inactive conformation of the 
activation loop by forming hydrogen bonds with the main-chain amide 
group of $838 and the side chain of D811, respectively (Fig. 2c). Thus, 
replacement of either residue might destabilize this particular inactive 
conformation of the activation loop, which would then be expected to 


Figure 2 | Modelling of FLT3-AC220 interactions. a, Docking model of the 
AC220-bound FLT3 kinase domain: AC220 (blue); activation loop (orange) 
and DFG motif (green); amino acid residues that confer AC220 resistance when 
mutated (F691, D835 and Y842) and that interact with AC220 (F691, C694 and 
F830) in sticks. b, Surface and stick presentation of AC220 and AC220- 
interacting residues on FLT3: the carbonyl oxygen of C694 interacts with one of 
the AC220 amide groups; F691 forms a m-1 stacking interaction with AC220; 
F830 interacts with AC220 through a perpendicular aromatic—aromatic 
interaction. c, Structure of the activation loop: residues D835,Y842 and 
interacting residues on FLT3 depicted in sticks. 
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hinder the binding of AC220. In further support of this model, a 
binding study of AC220 and sorafenib, a crystallographically proven 
type I inhibitor, revealed that the binding affinity of both inhibitors 
to the FLT3 D835V mutant is substantially reduced compared to 
native FLT3, both in the presence and absence of the juxtamembrane 
domain containing the ITD (Supplementary Fig. 5 and Supplemen- 
tary Table 5). 

Other potential explanations for the mechanism of resistance con- 
ferred by AC220-resistant mutants include increased kinase activity 
and differential activation of downstream effectors. Western blot ana- 
lysis of cells expressing AC220-resistant FLT3-ITD-mutant isoforms 
revealed increased FLT3 autophosphorylation of D835 mutant iso- 
forms, but no discernable difference in phosphorylation of down- 
stream targets, including the direct FLT3-ITD target STAT5A/B”, 
and no difference in cellular proliferation (Supplementary Fig. 6). 
Overall, these data support a primarily structural mechanism for 
AC220 resistance mediated by mutations at F691, D835 and Y842, 
although further studies are necessary for definitive confirmation. 
We speculate that the ability to retain inhibitory activity against activa- 
tion loop substitutions at D835 and Y842 will require a type I FLT3 
kinase inhibitor capable of effectively binding to the active, DFG-in 
conformation of the kinase. 

Substitutions at gatekeeper residues such as FLT3-ITD(F691) are 
well-documented causes of resistance to kinase inhibitors!***. 
Analogues of the FLT3-ITD(D835V) activation loop mutation have 
proven problematic for a number of kinase inhibitors: KIT(D816V), 
an activating mutation that is commonly detected in systemic 
mastocytosis, confers a high degree of resistance to imatinib and other 
KIT inhibitors”. Our data, although derived from a small cohort of 
patients that will need to be validated in larger studies, suggest that 
substitutions at F691 and D835 in FLT3-ITD will pose substantial 
barriers to disease control in AML patients treated with either 
AC220 or sorafenib, and therefore represent high-value targets for 
novel FLT3 inhibitor development efforts. 

Compelling data suggest that activating FLT3 mutations are acquired 
relatively late during leukaemogenesis in a pre-established clone’, and 
alone are insufficient to cause acute leukaemia in pre-clinical models’. 
Recent evidence suggests that the molecular heterogeneity of individual 
leukaemias can be substantial, and can occur in both branching and 
linear fashions early during leukaemogenesis, including at the 
leukaemia-initiating or ‘leukaemic stem’ cell level’*. Collectively, our 
data are consistent with acquisition of FLT3-ITD and drug-resistant 
FLT3 kinase domain mutations in a leukaemia-initiating cell popu- 
lation, although formal transplantation studies in mice are needed to 
address this question definitively. Our findings validate FLT3-ITD as a 
therapeutic target in human AML, and suggest that FLT3-ITD is 
capable of conferring a state of ‘oncogene addiction’, whereby cellular 
survival pathways associated with normal or precancerous cells can 
become hijacked, leading to a state of reliance upon key signalling 
molecules that can be exploited therapeutically. This work supports 
the exploration of therapeutic strategies targeting select activating 
mutations in other signalling molecules that are believed to be 
acquired relatively late in disease evolution, such as JAK2 (ref. 27) 
or RAS’, with agents capable of achieving clinically meaningful target 
inhibition. Further studies will be required to identify mechanisms of 
drug resistance that may circumvent reliance on activated FLT3 by 
activation of downstream or parallel pathways, as has been described 
with other kinase inhibitors**. To that end, translational studies using 
detailed molecular analyses of primary samples obtained from AML 
patients treated with clinically effective targeted therapeutics promise 
to further inform mechanisms of drug resistance, strategies for future 
drug development, and models of disease evolution. 


METHODS SUMMARY 


MSCVpuroFLT3-ITD plasmid DNA was mutagenized and used to generated 
AC220-resistant Ba/F3 clones as previously described''. The FLT3 kinase domain 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


was sequenced from PCR-amplified genomic DNA isolated from AC220- 
resistant clones. Identified drug-resistant mutations were re-engineered into 
MSCVpuroFLT3-ITD using site-directed mutagenesis and Ba/F3 cell lines were 
created as detailed in Methods. Cell viability in the presence and absence of drug 
was assessed using trypan blue exclusion. FLT3 phosphorylation status was deter- 
mined by western blot analysis of whole cell lysates prepared after 90 min of drug 
exposure from Ba/F3 cells stably expressing FLT3 mutant isoforms and from 
transfected 293T cells in the absence of drug. The FLT3 kinase domain was 
PCR amplified from cDNA derived from blood or bone marrow samples from 
patients enrolled on the exploratory portion of the phase II trial of AC220 in AML 
(http://clinicaltrials.gov/ct2/show/NCT00989261; identifier NCT00989261) and 
from normal control bone marrow or mobilized peripheral blood stem cells. 
PCR products were cloned into Escherichia coli and individual clones were 
sequenced using Sanger sequencing. Alternatively, SMRTBell libraries'* were pre- 
pared as per the manufacturer’s instructions and sequenced on a Pacific 
Biosciences RS instrument. For details of the computational sequencing analysis, 
please see Methods. Molecular docking of FLT3-ITD to AC220 was performed 
using Autodock 4.2 package. Inhibitor binding constants were measured using an 
active-site-dependent competition binding assay as previously descibed’®. For 
further details of methods, please see Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

DNA constructs, mutagenesis and resistance screen. FLT3-ITD cDNA cloned 
from the MV4;11 cell line (ITD: residues 591-601) into the Hpal site of the 
pMSCVpuro retroviral vector (Clontech) was a gift from Ambit Biosciences and 
was used as a template for mutagenesis. We used a modified strategy for random 
mutagenesis previously described'’. Briefly, 1 yg of MSCVpuroFLT3-ITD was 
used to transform the DNA-repair-deficient Escherichia coli strain XL-1 Red 
(Stratagene) and plated on 20 ampicillin-agar bacterial plates. After incubation 
for 36h, colonies were collected by scraping, and plasmid DNA was purified by 
using a plasmid MAXI kit (Qiagen). Subsequently, mutagenized FLT3-ITD 
plasmid stock and Ecopack packaging plasmid were cotransfected into 293T cells 
grown in DMEM (Invitrogen) containing 10% fetal calf serum (FCS; Omega 
Scientific) using Lipofectamine 2000 (Invitrogen) as per the manufacturer’s pro- 
tocol. Viral supernatants were collected at 48 h, purified using a 0.44 1m vacuum 
filter, and used to infect Ba/F3 cells at a 1:100 to 1:300 dilution of viral supernatant 
to fresh RPMI 1640 (Invitrogen) supplemented with 10% FCS. Alternatively, viral 
supernatant was aliquoted and frozen. Thawed supernatant was used to infect Ba/ 
F3 cells at a 1:50 dilution. Viral supernatant was diluted with the goal of minimiz- 
ing multiplicity of infection. For infection, 1-2 X 10° Ba/F3 cells were resuspended 
in 3 ml of the diluted viral stock supplemented with recombinant mouse IL-3 
(Invitrogen), and 4;1gml' polybrene, plated in each well of a 12-well tissue 
culture dish and centrifuged at 1,500g in a Beckman Coulter Allegra 6KR cent- 
rifuge with a microplate carrier for 90 min at 34°C. Centrifuged cells were subse- 
quently transferred to a 37°C incubator overnight. Infected Ba/F3 cells were 
washed twice with media to remove IL-3 and plated in 3 ml of RPMI medium 
1640 at 5 X 10° cells per well of a 6-well dish supplemented with 20% FCS and 
1.2% Bacto-agar with 20 nM AC220 (a gift from Ambit Biosciences). After 10-21 
days, visible colonies were plucked from agar and expanded in the presence of drug 
(20nM AC220). 

Sequencing and alignments. Expanded colonies were harvested 7-14 days after 
isolation from agar, and whole genomic DNA was isolated using the QlAamp kit 
(Qiagen). The FLT3 kinase domain was amplified by PCR from whole genomic 
DNA by using TopTaq DNA polymerase (Qiagen). The primers TKIF (5'- TG 
CTGTGCATACAATTCCCTTGGC-3’) and TK2R (5'- TCTCTGCTGAAAGGT 
CGCCTGTTT-3’) were used for kinase domain amplification and subsequent 
bidirectional sequencing was performed using these primers in addition to 
TKIR (5'- AGITCCTCCTCTTCTTCCAGCCTTT-3’) and TK2F (5'- GAGA 
GGCACTCATGTCAGAACTCA-3’). Alignments to the native FLT3-ITD 
sequence were performed using Sequencher software (Gene Codes Corporation). 
Generation of mutants. Mutants isolated in the screen were engineered into 
pMSCVpuroFLT3-ITD by using the QuikChange mutagenesis kit (Stratagene). 
In all cases, individual point mutants were confirmed by sequence analysis. 

Cell viability and proliferation assays. Stable Ba/F3 lines were generated by using 
retroviral spinfection with the appropriate mutated plasmid as outlined above, 
with the exception of the exclusion of polybrene. At 48 h post-infection, puromycin 
was added to infected cells at a concentration of 4 tg ml". Cells were selected in the 
presence of puromycin for 7-10 days and subsequently IL-3 was washed twice from 
the cells with media and cells were selected in RPMI medium 1640 plus 10% FCS in 
the absence of IL-3. Exponentially growing Ba/F3 cells (5 X 10*) were plated in each 
well of a 24-well dish with 1ml of RPMI 1640 plus 10% FCS containing the 
appropriate concentration of drug as indicated in triplicate. Cells were allowed to 
expand for 2 days and were counted by using a Vi-cell XR automated cell viability 
analyser (Beckman Coulter). The mean number of viable cells at varying concen- 
trations of drug was normalized to the median number of viable cells in the no-drug 
sample for each mutant. Error bars represent the standard deviation. Numerical 
ICso values were generated using nonlinear best-fit regression analysis using Prism 
5 software (GraphPad). 

For proliferation assays, on day 0, parental Ba/F3 cells and Ba/F3 cells stably 
expressing FLT3-ITD mutant isoforms were plated in triplicate with 1 mL of 
RPMI 1640 plus 10% FCS at a density of 5 X 10° cells per well in each well of a 
24-well dish. Cells were allowed to expand and were counted by using a Vi-cell XR 
automated cell viability analyser (Beckman Coulter) on days 2, 3, 4, 5, 6 and 7. To 
maintain exponential growth of cells, 0.5 ml of cells from each well were used for 
counting on each day and 0.25 ml of cells from the remaining volume were trans- 
ferred to a new well with 0.75 ml of fresh RPMI plus 10% FCS (including 2 ng ml 
of IL-3 for parental Ba/F3 cells). Extrapolated cell counts were calculated from the 
measured count on each day using the appropriate dilution factor (1X on day 2, 
4X on day 3, 16X on day 4, and so on). The number of viable cells at each time 
point was normalized to the starting number of cells for each cell line on day 0 and 
the mean normalized cell count on each day was calculated. Error bars represent 
the standard deviation. 

Immunoblotting. Exponentially growing Ba/F3 cells stably expressing each 
mutation along with a native FLT3-ITD control were plated in RPMI medium 
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1640 plus 10% FCS supplemented with kinase inhibitor at the indicated concen- 
tration. After a 90-min incubation, the cells were washed in phosphate buffered 
saline (PBS) and lysed in Cell Extraction Buffer (Invitrogen) supplemented with 
protease and phosphatase inhibitors. The lysate was clarified by centrifugation and 
quantified by BCA assay (Thermo Scientific). Protein was subjected to sodium 
dodecylsulphate polyacrylamide electrophoresis and transferred to nitrocellulose 
membranes. Immunoblotting was performed using anti-phospho-FLT3, anti- 
phospho-STAT5, anti-STAT5, anti-phospho-ERK, anti-ERK, anti-phospho-S6, 
anti-S6, anti-GAPDH (Cell Signaling) and anti-FLT3 $18 antibody (Santa Cruz 
Biotechnology). Alternatively, 293T cells were plated in 6-cm plates and trans- 
fected with MSCVpuroFLT3-ITD plasmid containing FLT3 mutations of interest 
using Lipofectamine 2000 (Invitrogen) as per the manufacturer’s protocol. After 
48h, cells were washed with PBS, collected, lysed and subjected to western blot 
analysis as described above. 

Competition binding assays. Inhibitor binding constants were measured by 
using active site-dependent competition binding assays essentially as previously 
described”. In brief, FLT3 protein isoforms were labelled with a chimaeric double- 
stranded DNA tag containing the NFkB binding site (50-GGGAATTCCC-30) 
fused to an amplicon for qPCR readout, which was added directly to the expression 
extracts. Binding reactions were assembled by combining DNA-tagged kinase 
extract, affinity beads loaded with a kinase inhibitor probe molecule, and test 
compound in 13 binding buffer (PBS, 0.05% Tween 20, 10 mM DTT, 0.1% BSA, 
2mg ml! sonicated salmon sperm DNA). Extracts were used directly in binding 
assays without any enzyme purification steps at a=10,000-fold overall stock 
dilution (final DNA-tagged enzyme concentration <0.1nM). Assays were incu- 
bated for 1h at room temperature (23 °C), which was sufficient to establish equi- 
librium. Subsequent washing, elution, and qPCR readout steps were as described”. 
Patients and FLT3 kinase domain sequencing analysis. Eight cases of acquired 
resistance to AC220 were analysed. Patients were enrolled on the exploratory 
cohort of the phase II clinical trial of AC220 in relapsed or refractory AML at 
the University of California, San Francisco (UCSF), University of Pennsylvania or 
Johns Hopkins University. Details of the clinical trials and results are reported 
elsewhere’. All patients were FLT3-ITD-positive at enrolment. Samples were col- 
lected pre-treatment and at the time of disease progression. Only patients who had 
achieved morphological clearance of bone marrow blasts to =5% at best response 
and subsequently relapsed with an increase in peripheral blood or bone marrow 
blasts are included in this analysis. The patients in this analysis included all the 
patients meeting the above criteria at the three participating institutions. All 
patients gave informed consent according to the Declaration of Helsinki to par- 
ticipate both in the clinical trials and for collection of samples. All research invol- 
ving human subjects was approved by the relevant Institutional Review Board at 
each individual participating institution (UCSF, University of Pennsylvania or 
Johns Hopkins). 

For sequencing, frozen Ficoll-purified mononuclear cells obtained from blood 
or bone marrow were lysed in Trizol (Invitrogen) and RNA was isolated according 
to the manufacturer’s protocol. cDNA was synthesized using Superscript II 
(Invitrogen) as per the manufacturer’s protocol. The FLT3 kinase domain and 
adjacent juxtamembrane domain were PCR amplified from cDNA using primers 
TKIF and TK2R as above. PCR products were cloned using TOPO TA cloning 
(Invitrogen) and transformed into competent E. coli. Individual colonies were 
plucked, expanded in liquid culture overnight and plasmid DNA for sequencing 
was isolated using the QIAprep Spin Miniprep kit (Qiagen). Each colony was 
considered representative of a single mRNA. To minimize contamination from 
PCR artefact, we sequenced at least 10 and up to 24 FLT3-ITD-containing clones 
from each sample and required that mutations be found in >15% of clones. The 
primers TK1F, TKIR, TK2F and TK2R were used for bidirectional sequencing as 
above. Alignments with native FLT3 sequence were performed using Sequencher 
software (Gene Codes Corporation). 

Sample preparation and SMRT sequencing. PCR product containing the FLT3 
kinase domain was generated from patient cDNA as described above using high 
fidelity DNA polymerase. We prepared PCR products for Pacific Biosciences sequen- 
cing using standard commercial kits and reagents (http://www.pacificbiosciences. 
com/products/consumables/reagents) following the manufacturer’s instructions. 
PCR products input amounts ranged from 0.3-3 wg, and we prepared SMRTBell 
libraries’ on the full PCR products without any fragmentation. We sequenced all 
samples on a Pacific Biosciences RS instrument and recorded sequence for 75 min. 
Computational analysis of FLT3 mutations. We obtained samples from three 
healthy individuals with no cancer history (two bone marrow, one mobilized 
peripheral blood stem cells), isolated RNA, made cDNA, amplified the FLT3 
kinase domain, and sequenced following a protocol identical to that used on the 
AML samples. We used the sequence from Normal Control no. 1 as a control for 
all process steps between sample acquisition and sequencing. Data from the 
remaining two normal controls were compared to the Normal Control no. 1 
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and revealed no significant differences (Supplementary Table 3). We use the 
circular consensus sequencing (CCS) mode to obtain high accuracy reads for 
the ~1.4kb amplicon with PacBio RS. The CCS mode generates reads by 
combining multiple independent single-pass sequencing reads for individual 
molecules to correct raw errors and generate a better accuracy consensus (see 
Supplementary Fig. 2). We report only the CCS reads where the same molecule 
is sequenced at three or more times, that is, raw read length >4.2 kb for the 1.4kb 
amplicon. With the CCS reads, we obtained the sequence of the ~ 1.4 kb amplicon 
containing the FLT3 juxtamembrane and kinase domains with up to about 98% 
to 99% accuracy (see alignment identity for ITD samples in Supplementary Table 
2). For each CCS read, we used tandem repeats finder (TRF)” to identify the ITD 
sequence. To determine unambiguously whether a read was ITD~ or ITD* con- 
sistently, we used only the CCS reads that included at least the region from the 
50-bp 5’-end upstream to the 50-bp 3’-end downstream sequence of the ITD 
region in the analysis. This allowed us to determine the number of sequences 
containing the ITD more accurately despite a small percentage of insertion and 
deletion errors in the CCS reads. Two distinct peaks allowed us to identify ITD ~ 
versus ITD* CCS reads unambiguously. We found that each sample had only one 
major ITD as expected, although in some cases the majority ITD differed at relapse 
compared to pre-treatment. We then passed the ITD* population of the CCS 
reads to the next stage for codon mutation analysis. A list of the number of total 
CCS reads identified is listed in Supplementary Table 2. We identified ~200-1,300 
CCS reads spanning the whole region between the ITD region and the furthest 
codon of interest (Y842) for codon analysis per sample. 

For codon mutation analysis, we restricted our analysis to the 608, 691, 835 and 
842 codons from reference sequence NM_004119 (Homo sapiens FLT3 mRNA) 
and then took the frequency of sequences obtained for each of these codons in the 
PCRamplicon of healthy Normal Control no. 1 and compared that to the frequency 
of sequences in each AML patient sample. A local quality filter that required exact 
matching of the codons before and after the codon of interest was used for filtering 
out low quality codon calls that might be due to sequencing errors. We used the 
observed frequencies from the control sample for calculating the significance of the 
observed mutation in the AML patient samples. The P value was calculated by 
comparing the numbers of native codons observed and the alternative codon 
between the control sample and the AML patient sample with Fisher’s exact test 
on the contingency table*”*’. Owing to the potential statistical bias that could arise if 
the number of observed mutations was small in some cases, or if sequencing error 
frequencies differed between mutant and reference codon sequences, we only report 
the mutations using a conservative significance threshold of P< 1 X 10°. Weused 
a simulation to determine the sensitivity of this analysis to detect a true mutation ata 
given codon position. Sensitivity in this analysis was determined as a function of 
three parameters: the number of errors observed in the control sequence, the total 
number of times that position is sequenced in the control, and the number of times 
that position is sequenced in the patient sample. For the simulation, we conserva- 
tively assumed that all alternative codons seen in the control are actual errors. With 
this simulation, we estimated that this analysis allows us to detect variants in the 
patient sample above 3% with high confidence if we get more than 300 observations 


of the codon of interest. To refine further our search for mutations underlying 
relapse in these patients, we considered only those mutations that were in cis to 
an ITD, as defined on being on the same single DNA molecule sequence read. These 
mutations at both baseline and relapse are listed in Table 2. 

Molecular docking. Molecular docking was performed using Autodock 4.2 pack- 
age. The FLT-ITD structure (residue 587-947) was prepared from the Protein 
Data Bank accession 1RJB"*. All bound waters were removed from the protein. The 
structure was then added for hydrogens, and partial atomic charges were assigned 
using AutoDockTools (ADT). Residues K644, F830, F691 and E661 were 
selected as flexible residues. To define the flexible residues, we first analysed the 
crystal structure of imatinib-bound inactive KIT kinase domain. The structure of 
FLT3 is quite similar to that of KIT, so this comparison helps us identify potential 
ligand interacting residues in FLT3. In the KIT structure, residue L595, K623, 
E640, L644, T670, Y672, L799 and F811 are close to the ligand. We thus define 
the corresponding residues in the FLT3 kinase domain (Y693, F830, F691, K644, 
E661, L818, L616, M665, V624) as flexible. Our docking studies revealed that the 
conformations of Y693, L818, L616, M665 and V624 in the docking solutions are 
largely identical to their conformations in the crystal structure, and so we did not 
consider these five residues to be flexible in the final calculations. 

The coordinates of AC220 were generated using the Dundee PROGRD2 
server’, and its initial conformation was energy minimized by the GROMACS 
force field. The Gasteiger charges were then assigned to the ligand using ADT. 
Seven torsion bonds were defined as rotable during the docking procedure. The 
ligand was put into the kinase ATP-binding pocket and manually aligned to avoid 
atom clashes. A three-dimensional grid box (dimensions: 60 X 30 X 60 unit in 
number of grid points, grid spacing: 0.375 A) centred at the ligand defining the 
search space was then created by AutoGrid4.2 (ref. 32). Two hundred runs of 
Larmarckian Genetic Algorithm were performed to optimize the ligand-protein 
interactions. The solutions were clustered according to the root mean standard 
deviation values, and ranked by the binding free energy. Two general poses are 
observed. The top-ranked pose has an average energy of — 10.32 kcal mol’ (the 
lowest energy for this pose is — 10.93 kcal mol” '). The second-ranked pose, which 
is flipped by 180° with respect to the top-ranked position, has 133 solutions (63%) 
with an average energy of —5.82 kcal mol’ (the lowest energy for this pose is 
—6.98 kcal mol’). Given the gap in the calculated energy, we only picked the 
lowest one for the purely illustrative purposes of this analysis. 
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Selective cortical representation of attended speaker 
in multi-talker speech perception 


Nima Mesgarani’ & Edward F. Chang! 


Humans possess a remarkable ability to attend to a single speaker’s 
voice in a multi-talker background’ *. How the auditory system 
manages to extract intelligible speech under such acoustically com- 
plex and adverse listening conditions is not known, and, indeed, it 
is not clear how attended speech is internally represented*’. Here, 
using multi-electrode surface recordings from the cortex of sub- 
jects engaged in a listening task with two simultaneous speakers, we 
demonstrate that population responses in non-primary human 
auditory cortex encode critical features of attended speech: speech 
spectrograms reconstructed based on cortical responses to the mix- 
ture of speakers reveal the salient spectral and temporal features of 
the attended speaker, as if subjects were listening to that speaker 
alone. A simple classifier trained solely on examples of single 
speakers can decode both attended words and speaker identity. 
We find that task performance is well predicted by a rapid increase 
in attention-modulated neural selectivity across both single- 
electrode and population-level cortical responses. These findings 
demonstrate that the cortical representation of speech does not 
merely reflect the external acoustic environment, but instead gives 
rise to the perceptual aspects relevant for the listener’s intended 
goal. 

Separating out a speaker of interest from other speakers in a noisy, 
crowded environment is a perceptual feat that we perform routinely. 
The ease with which we hear under these conditions belies the intrinsic 
complexity of this process, known as the cocktail party problem'**: 
concurrent complex sounds, which are completely mixed upon enter- 
ing the ear, are re-segregated and selected from within the auditory 
system. The resulting percept is that we selectively attend to the desired 
speaker while tuning out the others. 

Although previous studies have described neural correlates of mask- 
ing and selective attention to speech**’°, fundamental questions 
remain unanswered regarding the precise nature of speech representa- 
tion at the juncture where competing signals are resolved. In particular, 
when attending to a speaker within a mixture, it is unclear what key 
aspects (for example, spectrotemporal profile, spoken words and 
speaker identity) are represented in the auditory system and how they 
compare to representations of that speaker alone; how rapidly a selective 
neural representation builds up when one attends to a specific speaker; 
and whether breakdowns in these processes can explain distinct per- 
ceptual failures, such as the inability to hear the correct words, or follow 
the intended speaker. 

To answer these questions, we recorded cortical activity from 
human subjects implanted with customized high-density multi- 
electrode arrays as part of their clinical work-up for epilepsy surgery’®. 
Although limited to this clinical setting, these recordings provide 
simultaneous high spatial and temporal resolution while sampling 
the population neural activity from the non-primary auditory speech 
cortex in the posterior superior temporal lobe. We focused our analysis 
on high gamma (75-150 Hz) local field potentials'’, which have been 
found to correlate well with the tuning of multi-unit spike recordings’”. 
In humans, the posterior superior temporal gyrus has been heavily 
implicated in speech perception”’, and is anatomically defined as the 


lateral parabelt auditory cortex (including Brodmann areas 41, 42 
and 22)"*. 

Subjects listened to speech samples from a corpus commonly used 
in multi-talker communication research’*”®. A typical sentence was 
“ready tiger go to red two now” where “tiger” is the call sign, and “red 
two” is the colour-number combination. One male and one female 
speaker were selected, each speaking the same 12 unique combinations 
of two call signs (ringo or tiger), three colours (red, blue or green) and 
three numbers (two, five or seven). Example acoustic spectrograms from 
two individual speakers are shown in Fig. la, b. The two voices differ 
along several dimensions including pitch (male versus female), spectral 
profile (different vocal track shapes) and temporal characteristics 
(speaking rate). Subjects first listened to each of the speakers alone 
and were able to report the colour and number with 100% accuracy. 
Subjects then listened to a monaural, simultaneous mixture of the two 
speakers’ phrases with different call signs, colours and numbers. The 
subjects were instructed to respond by indicating the colour and number 
spoken by the talker who uttered the target call sign. The target call sign 
(ringo or tiger) was fixed and shown visually on a monitor during each 
trial block, which contained 28 different mixture sounds. As the target 
speaker was changed randomly from trial to trial, the subjects were 
required to monitor both voices initially (divided attention) to identify 
the target speaker. The target call sign was switched after each block, 
turning the previous target speaker in each mixture into a masker. This 
resulted in two sets of behavioural and neural responses for each ident- 
ical mixture sound, which differed only in the focus of attention. 
Subjects reported correct responses in 74.8% of trials. 

Figure Ic illustrates the mixture spectrogram and how difficult it is 
to tell which sound parts belong to one speaker versus the other. The 
energy for both speakers is distributed broadly across the spectral and 
temporal domains, with overlap in some areas and isolated sound parts 
in others, as shown in their difference spectrogram (Fig. 1d; average 
spectrograms in Supplementary Fig. 1a). 

To determine the spectrotemporal encoding of the attended 
speaker, the method of stimulus reconstruction was used'’"’ to estimate 
the speech spectrogram represented by the population neural responses. 
Reconstructed spectrograms provide an intuitive way to examine how 
the population neural responses encode the spectrotemporal features of 
speech, and more importantly, can be compared with the original 
acoustic spectrograms as well as across attentional conditions. We first 
calculated the reconstruction filters from a passive listening task using 
a separate continuous speech corpus (TIMIT”’) that consisted of 499 
unique short sentences spoken by 402 different speakers. The filters 
were then fixed and applied to a novel set of population neural res- 
ponses to the single and attended mixture speech for spectrogram 
reconstruction. 

When listening to a single speaker alone, the reconstructed spectro- 
grams from population neural activity corresponded well to the spectro- 
temporal features of the original acoustic spectrograms (Fig. le, f 
compared to Fig. la, b, respectively), exhibiting fairly precise temporal 
features and spectral selectivity (for example, correspondence between 
the high frequency bursts of energy in “tiger” and “two”, in Fig. 1a, b, e, f). 
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The average and standard deviation of the correlation between recon- 
structed and original spectrograms over 24 sentences were 
0.60 + 0.034 (0.60 and 0.62 for the examples in Fig. le, f). When 
attending to each of the two speakers, the reconstructed spectrograms 
from the same speech mixture showed a marked difference depending 
upon which speaker was attended (Fig. 1g, h). For each pair, the key 
temporal and spectral features of the target speaker are enhanced rela- 
tive to the masker speaker (Fig. 1g, h compared to Fig. le, f, respectively). 
To compare directly, the energy contours from these reconstructed 
spectrograms are overlaid in Fig. 1i. Important spectrotemporal details 
of the attended speaker were extracted, while the masker speech was 
effectively suppressed. 

Attentional modulation of the neural representation was quantified, 
separately for correct and error trials, by measuring the correlation of 
the reconstructed spectrograms from the mixture in two attended 
conditions with original acoustic spectrograms of the speakers alone 
(Fig. 2a—d). During correct trials (Fig. 2a, c), we observed a significant 
shift of average correlation values towards the target speaker repres- 
entation. During error trials, in contrast, no significant shift was 
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Figure 1 | Acoustic and neural reconstructed 
spectrograms for speech from a single speaker or 
a mixture of speakers. a, b, Example acoustic 
waveform and auditory spectrograms of speaker 

0 one (male; a) and speaker two (female; 

b). c, Waveform and spectrogram of the mixture of 
the two shows highly overlapping energy 
distributions. d, Difference spectrogram highlights 
the mixture regions where speaker one (blue) or 
two (red) has more acoustic energy. e, f, Neural- 
population-based stimulus reconstruction of 
speaker one (e) and speaker two (f) alone shows 
similar spectrotemporal features as the original 
spectrograms in a and b. g, h, The reconstructed 
spectrograms from the same mixture sound when 
attending to either speaker one (g) or two 

(h) highly resemble the single speaker 
reconstructions, shown in e and f, respectively. 

i, Overlay of the spectrogram contours at 50% of 
maximum energy from the reconstructed 
spectrograms in e, f, g and h. 
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observed (Fig. 2b, d). Furthermore, the correlations between the 
reconstructed mixture and the masker speaker were higher than the 
average intrinsic correlation between randomly chosen original 
acoustic speech phrases (Fig. 2c, d, dashed lines), revealing a weak 
presence of the masker speaker in mixture reconstructions, even in 
correct trials. 

The difference in speaking rate of the two speakers, coupled with the 
stereotyped structure of the carrier phrases, results in specific average 
temporal modulation profiles for each speaker (average spectrogram 
for each speaker is shown in Supplementary Fig. 1a, b). To investigate 
encoding of the distinct spectral profile and characteristic temporal 
rhythm of the target compared to the masker speaker, we estimated the 
average difference between reconstructed spectrograms of the two 
speakers, when presented alone and in the attended mixture (Fig. 2e, 
f). The comparison between the two average difference reconstructed 
spectrograms reveals enhanced encoding of both temporal and 
spectral aspects of the attended speaker (Supplementary Fig. Ic, d). 
To study the time course of attention-induced modulation of recon- 
structed mixture spectrograms towards the attended speaker, we 
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calculated an attentional modulation index (AMI,,-<), using a sliding 
window of 250 ms throughout the trial duration: 


AMI spec = Corr(SP1 specs SP lattena) ~~ Corr(SP 1 pecs SP2attend) 
+ Corr(SP2specs SP 2attend) = Corr(SP2,pec SP Lattena) (1) 


where SPl pec and SP2.pe- are the original acoustic spectrograms of 
speakers one and two, respectively, and SP Latteng aNd SP2attena are the 
spectrograms reconstructed from neural responses to the mixture with 
attended targets, speaker one and two, respectively. Positive values of 
this index reflect shifts towards the target, negative values reflect shifts 
to the masker representation, and values around zero reflect no shift 
(AMI, 5c = 0.58 for the example in Fig. 1). An upper bound for the 
AMlI.pec was calculated by assuming that attention, at best, restores the 
single speaker reconstructions of the target speaker (replacing SP lattena 
and SP2attena in equation (1) with SPlatone and SP2 lone; Fig. 2g, grey 
line). The AMI,,,.. from the mixture was first estimated from correct 
trials (Fig. 2g, black line), and could resolve the time point at which the 
reconstructed spectrograms were modulated by attention. After the 
end of the call sign, which cues the speaker that should be attended, a 
rapid positive shift in the AMI,,.< was observed, implying the 
enhanced representation of the target speaker. In error trials, this effect 
shows a bias towards the masker speaker, which, in contrast, occurred 
far earlier in the time course. The neural response shift towards the 
masker, which occurs as early as the call sign, suggests that listeners 
had prematurely attended to the wrong speaker during those error 
trials. 

Although the reconstruction analyses showed clear attention- 
based spectrotemporal modulation, we wanted to determine explicitly 
whether the attended speech in a mixture could be decoded from a 
model of a single speaker. A regularized linear classifier** was trained 
on neural responses to the single speakers and then used to decode 
both the spoken words and speaker identity of the attended speech 
mixture. To keep the chance performance at 50% across all com- 
parisons, classification results were limited only to the choices that 
were present in each mixture. For correct trials, the colour and number 
of the attended speech were decoded with high accuracy (77.2% and 
80.2%, P< 10 X 10 “+, t+test; Fig. 3a). However, the decoding perform- 
ance during error trials was significantly below chance (30.0%, 30.1%, 
P<10X 10“, t-test; Fig. 3b), indicating a systematic bias towards 
decoding the words of the masker speaker. In addition, for correct 
trials, the call sign was classified at chance performance (Fig. 3a). 
However, for incorrect trials the classifier detected the masker call 
sign significantly more often than the target call sign (34.1%, 
P<10X 10“, t-test; Fig. 3b), which again shows errors due to an 
early selection of the masker (incorrect) speaker. 
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Figure 3 | Decoding spoken words and the identity of the attended speaker. 
a, Classification rate and standard deviation for spoken words (call sign, colour 
and number) of the attended speaker from the neural responses to the 28 
mixtures. Classifiers were trained on single speaker examples only. Colour and 
number of the attended speech are decoded with high accuracy (77.2% and 
80.2%, P< 10 X 10 “, t-test) in correct trials, but not the call sign (48.0%, not 
significant (NS), t-test). b, In error trials, the classifier showed a systematic bias 
towards the words of the masker speaker (34.1%, 30.0%, 30.1%, P< 10 X 10 4, 
t-test). c, Attended speaker identification rate and standard deviation in correct 
for target, incorrect (for both target and masker), and correct for masker trials. 
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For the speaker identification analyses, we divided the behavioural 
error types into two subsets. The first type occurred when the reported 
colour-number combination was incorrect for either speaker (‘in- 
correct’; 16.5% of trials). The second type occurred when subjects 
reported the correct colour-number for the masker instead of the 
target speaker (‘correct for masker’; 8.6% of trials). 

In correct trials, the classifier identified the target speaker 93.0% of 
the time (P< 10X 10 4, t-test; Fig. 3c). During incorrect trials, the 
classifier performance was at chance. However, during correct for 
masker trials, the classifier identified the masker rather than the target 
speaker (27.3%; P< 10 X 10 4, t-test; Fig. 3c). These classification 
results confirm the observed restoration seen in spectrotemporal 
reconstruction, without necessarily assuming a linear relationship 
between the neural responses and the stimulus. Furthermore, they 
extend recent findings using similar methods to decode speech sounds 
presented in isolation” to full words and sentences under complex 
listening conditions. 

We next asked whether the observed robust encoding of attended 
speech results as an emergent property of the distributed population 
activity or is driven by a few spatially discrete sites. The cortical regions 
with reliable evoked responses to speech stimuli were found using a 
t-test between neural responses during speech and silence (P< 0.01), 
and were confined to the posterior superior and middle temporal gyri 
(Fig. 4a). An example of the attentional response modulation at a 
single electrode is shown in Fig. 4b-d. The spectrotemporal receptive 
field (STRF, estimated using the http://www.strflab.berkeley.edu 
package) of this electrode in passive listening to speech (TIMIT’®) 
showed a strong preference for high frequency sounds (Fig. 4b) 
(STRFs for all electrodes of one subject are provided in Supplemen- 
tary Fig. 2b). This tuning was also evident in the increased neural 
response at this electrode (Fig. 4d, dashed lines) to each of the single 
speakers’ high frequency sound components (circled in Fig. 4c, res- 
ponses are delayed about 120 ms from the stimulus). However, the 
responses to the same speech mixture sound (Fig. 4d, solid lines) were 
significantly modulated by attention. The responses to high frequency 
components were enhanced for the attended speaker, but suppressed 
for similar sounds in the masker speaker (Fig. 4d, solid lines compared 
to dashed lines). This highly modulated yet fixed feature selectivity 
probably contributes to the constancy of the single speaker representa- 
tion observed in our previous analyses. To quantify this effect for each 
individual electrode, we measured the correlation between the neural 
responses to the attended mixture and to those of the speakers in 
isolation (AMI,.., equation (2) in Methods). We found a varying 
degree of bias towards the attended speaker distributed across the 
population (Supplementary Fig. 3d; AMI... = 0.28 for the example 
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Figure 4 | Attentional modulation of individual electrode sites. a, Electrodes 
picking up a significant difference between responses to silence and speech 
sounds (P < 0.01, t-test). b, STRF of this representative electrode site shows a 
preference for high frequency sounds. c, Mixture difference spectrogram for a 
selected duration containing a high frequency component for each speaker 
(circled). d, The electrode shows an increased response to high frequency 
sounds of single speakers (dashed lines, peak neural response is delayed by 
about 120 ms). However, the neural response to the same mixture sound in two 
attention conditions (solid lines) showed an enhanced response to high 
frequency sounds only for the target, but with responses for similar sounds in 
the masker speaker suppressed. 
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in Fig. 4), which gradually builds up after the end of the call sign 
(Supplementary Fig. 3e). We did not observe any particular anatomical 
pattern for the attentional modulation across sites (Supplementary 
Fig. 3f). Rather, it appeared to be distributed over responsive sites, 
consistent with previous findings of higher-order sound processing”’. 

In summary, we demonstrate that the human auditory system 
restores the representation of the attended speaker while suppressing 
irrelevant competing speech. Speech restoration occurs at a level where 
neural responses still show precise phase-locking to spectrotemporal 
features of speech. Population responses revealed the emergent rep- 
resentation of speech extracted from a mixture, including the 
moment-by-moment allocation of attentional focus. 

These results have implications for models of auditory scene ana- 
lysis. In agreement with recent studies, the cortical representation of 
speech in the posterior temporal lobe does not merely reflect the 
acoustical properties of the stimulus, but instead relates strongly to 
the perceived aspects of speech’’. Although the exact mechanisms are 
not fully known, multiple processes in addition to attention are likely 
to enable this high-order auditory processing, including grouping of 
predictable regularities in speech acoustics”, feature binding** and 
phonemic restoration*’. Conversely, behavioural errors seem to result 
from degradation of the neural representation, a direct result of inherent 
sensory interference such as energetic masking'® (Supplementary 
Fig. 3g, h) and/or the allocation of attention”. 

In speech, the end result represented in the posterior temporal lobe 
appears to be unaffected by perceptually irrelevant sounds, which is 
ideal for subsequent linguistic and cognitive processing. Following one 
speaker in the presence of another can be trivial for a normal human 
listener, but remains a major challenge for state-of-the-art automatic 
speech recognition algorithms”*. Understanding how the brain solves 
this problem may inspire more efficient and generalizable solutions 
than current engineering approaches”. It will also shed light on how 
these processes become impaired during ageing and in disorders of 
speech perception in real-world hearing conditions’. 


METHODS SUMMARY 


Three human subjects with normal hearing underwent the placement of a 
subdural electrode array as part of their clinical treatment for epilepsy. We used 
speech samples from a publicly available database called Coordinate Response 
Measure (CRM'°). One male and one female speaker were selected with two call 
signs (ringo and tiger), three colours (red, blue or green) and three numbers (two, 
five or seven). We generated 12 unique combinations of call sign, colour and 
number per speaker (total of 24 single speaker phrases) and 28 mixture speech 
samples by selecting from combinations of the 24 single speaker sentences (0 dB 
target-to-masker ratio). Speech sounds were presented monaurally from a loud 
speaker. We used stimulus reconstruction’? ° to map the population electro- 
corticographic response to the spectrogram of the speech stimulus. 
Reconstruction filters were estimated from neural responses to a separate speech 
corpus (TIMIT’®). Test speakers were not used in the estimation of filters. For 
word and speaker decoding analysis, a regularized linear classifier’! was trained on 
neural responses of the single speakers and then used to decode the spoken words 
and speaker identity of the attended speech mixture. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


The experimental protocol was approved by the Committee for Human Research 
at the University of California, San Francisco. 

Subjects. Three human subjects underwent the placement of a high-density 
subdural electrode array (4mm pitch) over the language-dominant hemisphere 
as part of routine clinical treatment for epilepsy. Subjects gave their written 
informed consent before surgery. All subjects had self-reported normal hearing 
and underwent neuropsychological language testing (including the Boston nam- 
ing and verbal fluency tests) and were found to be normal. The intracarotid 
sodium amobarbital (Wada) test was used for language dominance assessment. 
The electrodes in the study were located over the posterior dorsolateral temporal 
lobe. The location and corresponding spectrotemporal receptive fields of all the 
included electrodes for a subject are shown in Supplementary Fig. 2. 

Data acquisition and pre-processing. The electrocorticography signal was 
recorded with a multichannel amplifier optically connected to a digital signal 
processor (TuckerDavis Technologies). Each channel time series was visually 
and quantitatively inspected for artefacts or excessive noise. The data were then 
segmented with a 100 ms pre-stimulus baseline and a 400 ms post-stimulus interval. 
The common mode signal was estimated using principal component analysis with 
channels as repetitions and was removed from each channel time series using vector 
projection. 

Task design and behavioural testing. We used speech samples from a publicly 
available database called Coordinate Response Measure (CRM"*) containing 
sentences in the form “ready (call sign) go to (colour) (number) now”. One male 
and one female speaker (speakers one and five in CRM corpus) were selected with 
two call signs (ringo and tiger), three colours (blue (B), red (R) or green (G)) and three 
numbers (two, five or seven). For each of the two call signs, we generated six colour- 
number combinations (B2, B5, R2, R7, G5, G7), resulting in 12 different phrases. We 
chose the same phrases for each of the two speakers, resulting in 24 single speaker 
sentences. We then produced 28 unique mixture speech samples by selecting from 
combinations of the 24 single speaker sentences at 0 dB target-to-masker ratio. Each 
mixture sample was chosen such that there was no overlap between call signs, colours 
or the numbers of the two phrases. In addition, each speaker had the same number of 
call signs (ringo or tiger) in each trial block. The sounds were presented monaurally 
from a loudspeaker connected to a laptop, which was also used to collect subjects’ 
responses through a customized graphical user interface. Each trial block consisted of 
28 trials and the target call sign was fixed for each block. The target call sign was 
displayed visually before and during the trial block. Subjects first listened to each of 
the speakers alone and were able to report the colour and number with 100% 
accuracy. Subjects then listened to a monaural, simultaneous mixture of the two 
speakers’ phrases with different call signs, colours and numbers. The subjects were 
instructed to respond by indicating the colour and number spoken by the talker who 
uttered the target call sign. The target speaker changed from trial to trial pseudo- 
randomly, requiring the subjects to initially monitor both speakers until they detect 
the target call sign. After each trial block, the target call sign was changed, switching 
the role of target and masker speakers in each mixture sound. 

Electrode selection. The cortical sites on the superior and middle temporal gyri 
with reliable evoked responses to speech stimuli were selected for all the 
subsequent analysis. Our inclusion criteria consisted of a t-test between responses 
to randomly selected time frames during passive speech presentation (TIMIT) 
and in silence (P < 0.01, resulting in 83, 92 and 102 electrodes for subjects one to 
three. One example subject is shown in Supplementary Fig. 2a). Solely for 
visualization, we also estimated the STRFs of these selected sites from passive 
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listening to TIMIT using normalized reverse correlation algorithm (STRFLab 
software package, http://www.strflab.berkeley.edu; Supplementary Fig. 2b). 
Correlation histogram of STRF predictions for all 275 electrode sites is shown 
in Supplementary Fig. 1c. 
Stimulus reconstruction. We used stimulus reconstruction to map the popu- 
lation neural responses to the spectrogram of the speech stimulus'””. 
Reconstruction filters were estimated from neural responses to a separate speech 
corpus (TIMIT*®) containing a total of 499 unique short sentences from 402 
different speakers. Filters were obtained using normalized reverse correlation to 
minimize the mean squared error of the reconstructed spectrograms” with filter 
time lags from —420 to 0 ms (causal filters). The filters were then fixed in all 
subsequent conditions and were applied to the neural responses to CRM samples. 
Neither of the speakers or phrases in the CRM data set was used in estimation of 
the filters. The output of the reconstruction algorithm was further processed with a 
band-pass filter applied to each frequency channel of reconstructed spectrograms 
to remove the baseline. All the processing steps for stimulus reconstruction were 
identical in all conditions (single and mixture speakers). 
AMI. To quantify the change in similarity between the representation of single and 
attended speaker in mixture speech, we defined the AMI... in equation (1). The 
stereotypical format of the CRM phrases results in an intrinsic correlation between 
the neural responses to different sentences, particularly at the beginning (“ready”) 
and middle of the carrier phrase (“go to”), which results in reduced possible 
AMI, pec values for these segments. To estimate an upper bound for unbiased 
comparison, AMI,,.< was calculated where the representation of an attended 
speaker in a mixture is ideally assumed to be identical to the representation of 
that speaker when presented alone; therefore, replacing SPattena in equation (1) 
with the reconstructed spectrogram of single speaker SPatone. The upper bound 
peaks at the call sign, colour and number where different phrases are most 
dissimilar. The overall increase in the upper bound is due to the progressive 
asynchrony between the two speakers. 

The same statistics can be used to estimate the AMI of an individual electrode 
site by calculating the correlation values between the neural response of that site to 
attended mixture and single speaker presentations: 


AMIetec = Corr(R-SP latones R-SPlattena) — Corr(R-SP latones R-SP2attena) + 
Corr(R-SP2atone: R-SP2attena) — Corr(R-SP2atone, R-SPlatiend) (2) 


where R-SP Lajone and R-SP2ajone are the responses of an electrode to speakers one 
and two alone, respectively, and R-SPLatteng aNd R-SP2attend are the responses of 
the same electrode to the mixture of the two when the attended target is speaker 
one and two, respectively. 

Classification of spoken words and speaker identity. A linear-frame-based 
regularized-least-square classifier?! was used to investigate the discriminability 
of the spoken words and speaker identity from electrocorticographic responses. 
Two binary classifiers were trained to classify the call sign and speaker identity, 
and two separate three-way classifiers were used for colour and for number 
classification. Classifiers were trained only on the neural responses of single speak- 
ers (24 sentences) and tested on the mixtures. The classifiers produced a linear 
weighted sum of the neural responses at each time instance and the classifier that 
produced the maximum average output over the duration of words was chosen as 
classification result. The classifier decision was limited to only the colours and 
numbers that occurred in each mixture, therefore resulting in same 50% chance 
performance in all cases. 
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SOLID-STATE PHYSICS 


Electrons do the split 


Interacting electrons that are confined to move in a one-dimensional structure do not simply jam together like cars in rush 
hour. Inelastic X-ray scattering shows that the electrons act as if they split into separate fractional entities. 


RALPH CLAESSEN 


explained by reducing it to its simplest 

version. For example, to understand the 
quantum-mechanical motion of non-inter- 
acting electrons in a crystalline solid, the case 
of a one-dimensional crystal lattice suffices 
to introduce the concept of electronic band 
structure, which describes the range of ener- 
gies that the electrons may have in the solid. 
But when it comes to interacting electrons, this 
approach fails. Coulomb repulsion between any 
two electrons is much stronger in one-dimen- 
sional solids than in their higher-dimensional 
counterparts, and many-body effects emerge 
that lead to an apparent fractionalization of the 
electron’ °. With this fractionalization, the elec- 
tron’s spin and charge seem to form separate 
quasiparticles — spinons and holons, respec- 
tively — that move independently of each 
other and have different velocities. In a paper 
published on Nature’s website today, Schlappa 
et al.° describe an experiment that takes the idea 
of electronic fractionalization a step further by 
revealing that electrons can split into a third 
form of quasiparticle, the orbiton. 

Schlappa and colleagues observe orbitons 
in a copper oxide (Sr,CuO,) containing a one- 
dimensional chain of copper oxide building 
blocks (CuO,), in which the valence electrons 
reside in the 3d electronic shell of the central 
copper atoms (Fig. 1). In a free atom there are 
five 3d orbitals, which are energetically degen- 
erate — that is, they all have the same energy. 
However, in a CuO, unit, the electrical field 
of the oxygen ions around each copper atom 
lifts this degeneracy, generating an intra-shell 
energy spectrum. In the chain's ground state, 
the 3d electrons in all of the CuO, units are in 
the same, lowest-energy 3d state, with their 
spin orientation alternating between neigh- 
bouring units as a result of antiferromagnetic 
interactions. 

In such a system, electrons can be locally 
excited between the energetically different 
3d states, leaving one of the CuO, units in a 
state known as crystal-field excitation. To 
achieve this excitation, Schlappa et al. shone 
X-rays on the CuO, chain using a technique 
called resonant inelastic X-ray scattering. 
The method is similar to conventional optical 
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Figure 1 | Electron fractionalization in a one-dimensional structure. a, Schlappa and colleagues’ system® 
consists of a chain of copper oxide units (CuO) ). In the ground state, all units contain 3d electrons in the 
same in-plane orbital, and the direction of the local 3d spins (arrows) alternates between neighbouring 
units. Shining X-ray photons on the chain excites the electrons in one of the units into an out-of-plane 
orbital of higher energy. b, The authors show that the X-ray irradiation leads to an immediate splitting of 
the excitation into two separate quasiparticles of different velocities: a spinon, which is associated with a 
local perturbation in the spin arrangement, and an orbiton, which carries the orbital excitation. 


Raman scattering, but owing to the short wave- 
lengths of X-rays, it provides information 
not only about a sample's electronic excita- 
tions, but also about the excitations’ spatial 
dynamics. 

Traditionally, crystal-field excitations in 
3d-metal compounds have been viewed as 
localized objects that stay fixed to the atomic 
site at which they were generated. But Schlappa 
and colleagues’ provide clear evidence that, in 
one dimension, such entities break up into 
two independent, mobile fragments immedi- 
ately after excitation. These fragments are the 
spinon, which isa local disturbance of the spin 
arrangement of the electron ensemble, and the 
orbiton, a collective response of all 3d electrons 
in the chain, which carries the 3d intra-shell 
excitation. The authors find that the separa- 
tion of the spinon and the orbiton and their 
ensuing dynamical behaviour are in excellent 
agreement with theoretical expectations. 

The concept of orbitons as collective orbital 
excitations in 3d-metal compounds, which is 
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similar to that of spin waves in magnetically 
ordered systems, was introduced more than 
a decade ago’. However, for a long time their 
propagating character escaped unambiguous 
experimental verification, because previous 
work* had been directed at materials of higher 
dimensions than the one-dimensional system 
studied here. In such systems, intricate cou- 
pling of the initial intra-shell 3d excitation 
to magnetic excitations tends to immobilize 
the orbitons, preventing their observation 
as propagating entities. The spinon-orbiton 
separation reported by Schlappa et al. not only 
establishes a new aspect of one-dimensional 
electron fractionalization, but also provides 
the first clear observation of moving orbitons 
ina solid. 

It should be noted that this observation has 
become possible only through the enormous 
advances made in resonant inelastic X-ray 
scattering during the past few years, notably 
through pioneering work at the Swiss Light 
Source facility at the Paul Scherrer Institute 
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in Villigen, Switzerland, where the authors’ 
experiments*® were conducted. Schlappa and 
colleagues’ results thus highlight the increas- 
ingly crucial role of resonant inelastic X-ray 
scattering in the study of electronic excita- 
tions, just as inelastic neutron scattering is the 
method of choice in the study of magnetic and 
lattice excitations®. 

Finally, are there any practical implications 
for spinon-orbiton separation? As the micro- 
electronics industry moves towards ever- 
increasing miniaturization, electronic devices 
and the conducting connections between them 
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will eventually reach quantum-mechanical 
limits. When the diameter of the connections 
reaches atomic dimensions, the transport of 
charge in them — and also of spin and heat — 
will no longer follow the laws for macroscopic 
(three-dimensional) conductors, and one- 
dimensional electron fractionalization will 
become relevant. Whether such fractionaliza- 
tion can be used for new device functionalities 
in quantum computing or spin electronics 
remains to be seen. = 
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In vivo reprogramming of murine cardiac 
fibroblasts into induced cardiomyocytes 


Li Qian’??, Yu Huang)*, C. lan Spencer??, Amy Foley)?, Vasanth Vedantham!*°, Lei Liu)?°, Simon J. Conway°, 


Ji-dong Ful”? & Deepak Srivastava!” 


The reprogramming of adult cells into pluripotent cells or directly into alternative adult cell types holds great promise for 
regenerative medicine. We reported previously that cardiac fibroblasts, which represent 50% ofthe cells in the mammalian 
heart, can be directly reprogrammed to adult cardiomyocyte-like cells in vitro by the addition of Gata4, Mef2c and Tbx5 
(GMT). Here we use genetic lineage tracing to show that resident non-myocytes in the murine heart can be reprogrammed 
into cardiomyocyte-like cells in vivo by local delivery of GMT after coronary ligation. Induced cardiomyocytes became 
binucleate, assembled sarcomeres and had cardiomyocyte-like gene expression. Analysis of single cells revealed 
ventricular cardiomyocyte-like action potentials, beating upon electrical stimulation, and evidence of electrical 
coupling. In vivo delivery of GMT decreased infarct size and modestly attenuated cardiac dysfunction up to 3 months 
after coronary ligation. Delivery of the pro-angiogenic and fibroblast-activating peptide, thymosin 84, along with GMT, 
resulted in further improvements in scar area and cardiac function. These findings demonstrate that cardiac fibroblasts can 
be reprogrammed into cardiomyocyte-like cells in their native environment for potential regenerative purposes. 


Heart failure affects over 14 million people worldwide and is a leading 
cause of death in adults and in children. Because postnatal cardio- 
myocytes (CMs) have little or no regenerative capacity, therapies are 
limited at present. The introduction of exogenous stem-cell-derived 
CMs holds promise, but also challenges, including delivery, integ- 
ration, rejection and cellular maturation’. Reprogramming adult 
fibroblasts into induced pluripotent stem cells (iPSCs) that are similar 
to embryonic stem cells addresses some issues*°, but others, including 
efficient directed differentiation into CMs and effective delivery, 
remain. 

A new generation of reprogramming technology involves trans- 
differentiating one adult somatic cell type directly into another’"’. 
We reported direct reprogramming of fibroblasts into CM-like cells 
in vitro by expressing three transcription factors: Gata4, Mef2c and 
Tbx5 (GMT)’. As observed in reprogramming to iPSCs, the percent- 
age of fibroblast cells fully reprogrammed to beating CMs in vitro was 
small, but far more were partially reprogrammed, much like pre- 
iPSCs that can become fully pluripotent with additional stimuli’. 
We posited that cardiac fibroblasts may reprogram more fully in vivo 
in their native environment, which might promote survival, matura- 
tion, and coupling with neighbouring cells. If so, the vast pool of 
cardiac fibroblasts in the heart could serve as an endogenous source 
of new CMs for regenerative therapy. 


Retroviral delivery of GMT in vivo 


We used a retroviral system to express GMT, and/or dsRed as a 
marker, in the hearts of 2-month-old male mice by direct intramyo- 
cardial injection. After 2 days, transverse sections of the injected area 
were prepared and co-stained for dsRed, «-actinin (a CM marker) and 
vimentin (enriched in fibroblasts). No markers are uniquely specific 
for cardiac fibroblasts, but fibroblasts are known to express vimentin 
and the surface markers Thyl and DDR2 (ref. 13). At baseline, it was 
difficult to detect o-actinin- or vimentin-positive cells that also 


expressed dsRed, suggesting minimal viral uptake, and consistent 
with the observation that retroviruses only infect actively dividing 
cells'*. 

Fibroblasts are embryologically distinct from CMs in their origin”, 
and following myocardial infarction (MI) become activated, migrate 
to the injury site, and proliferate*’”. We induced cardiac injury by 
coronary artery ligation and injected dsRed retrovirus into the myo- 
cardium bordering the infarct zone. Whereas cells co-expressing 
dsRed and «-actinin were still undetectable, many vimentin-positive 
cells were also positive for dsRed (Supplementary Fig. 1). By fluor- 
escence-activated cell sorting (FACS), over 4% of cells (98,238 + 5,523) 
from the left ventricle of injected hearts were dsRed* Thyl* 2 days 
after injury, suggesting successful delivery of virus into cardiac 
fibroblasts and possibly other non-myocytes upon injury (Fig. la, b). 
By quantitative polymerase chain reaction (qPCR), dsRed* Thy1*- 
sorted cells expressed about 60-fold more GMT than dsRed” Thy1* 
cells, and 6-8-fold more than endogenous CMs (Fig. 1c). A similar 
number of dsRed* Thy” cells represented other non-myocyte cell 
types. Endothelial cells (PECAM"*) and some perivascular cells 
(NG2°; also known as Cspg4) were also transduced by the retrovirus, 
but haematopoietic (CD34*) and pericardial (WT1°*) cells were not 
(Supplementary Fig. 1). 


Reprogramming into induced cardiomyocytes 


To determine whether new cardiomyocytes could be created in vivo 
from cells other than post-mitotic CMs, we used lineage-tracing 
experiments to track the origin of putative induced cardiomyocytes 
(iCMs). To label cells genetically, we used a mouse transgenic line that 
expresses Cre recombinase under the promoter of the fibroblast- 
enriched gene, periostin’®'*’’. When intercrossed with the R26R- 
lacZ reporter line”, in which B-galactosidase is activated only in 
periostin-Cre-expressing cells and their progeny (Fig. 1d-f), we found 
B-galactosidase activity in many, but not all, cardiac fibroblasts and 


1Gladstone Institute of Cardiovascular Disease, San Francisco, California 94158, USA. Department of Pediatrics, University of California, San Francisco, California 94158, USA. 3Department of 
Biochemistry and Biophysics, University of California, San Francisco, California 94158, USA. *Cardiovascular Research Institute, University of California, San Francisco, California 94158, USA. Department 
of Medicine, University of California, San Francisco, California 94158, USA. (Developmental Biology and Neonatal Medicine Research Program, Indiana University School of Medicine, Indianapolis, Indiana 
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Figure 1 | Genetic lineage tracing demonstrates in vivo reprogramming of 
cardiac fibroblasts to CM-like cells. a, Quantification of FACS analyses for 
Thy1” cells from sham-operated mouse hearts or hearts 2 days after MI (n = 3, 
* P< 0.05). b, FACS analyses of Thy1* dsRed* cells from sham-operated or 
post-MI hearts injected with dsRed-expressing retrovirus, with quantification 
(left) and representative FACS plots (right) (n = 3, *P < 0.05). APC, 
allophycocyanin. c, qPCR analysis of Gata4, Mef2c and Tbx5 in Thy1* dsRed* 
cells or endogenous CMs compared to Thyl* dsRed™ cells sorted 2 days after 
post-MI intramyocardial Gata4, Mef2c, Tbx5 and dsRed (GMTR) injection. 
n = 3 with technical quadruplicates. d, Schematic diagram showing the genetic 
fate mapping method to trace the lineage of CMs reprogrammed from 
periostin-Cre:R26R-lacZ or Fsp1-Cre:R26R-lacZ cells. BGal, B-galactosidase; 
Postn, periostin. e, Immunofluorescent staining for «-actinin (green) and 
B-galactosidase (red) on sham-operated or post-MI periostin-Cre:R26R-lacZ 
mouse hearts 4 weeks post-surgery. Images are from distant or border zones 
where endogenous CMs were labelled by «-actinin, but were never co-localized 
with §-galactosidase (n = 5 hearts per condition, 8 sections per heart). Scale 
bar, 50 um. f, g, Immunofluorescent staining for o-actinin, B-galactosidase and 
DAPI in infarct areas of dsRed- or GMT-injected periostin-Cre:R26R-lacZ 
(f) or Fsp1-Cre:R26R-lacZ (g) mouse hearts 4 weeks after MI. Boxed areas 
indicate regions of magnification. Scale bar, 50 um. Error bars indicate standard 
error of the mean (s.e.m.). 


some endocardial and endothelial cells, as previously reported’*'*"”. 
Periostin-Cre activity was absent in bone marrow cells (not shown). 
B-Galactosidase activity was not detected in any cardiomyocytes, even 
4 weeks after injury, consistent with thoracic aortic banding studies, 
confirming that the periostin-Cre mice mark only descendants of the 
non-myocyte population, even after MI (Fig. le)'>’*"”. Isolation of 
single CMs from these hearts confirmed the absence of f-galactosi- 
dase activity in over 1,500 CMs per heart from 6 mice. 

In contrast, 4 weeks after MI and retroviral delivery of GMT, 
numerous f-galactosidase* cells were o-actinin* in the injured areas, 
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with well-formed sarcomeres and shapes similar to B-galactosidase — 
myocytes, suggesting that they were descendants of cells that once 
expressed periostin (Fig. 1f). Similar results were obtained using 
transgenic mice in which Cre recombinase was under the control of 
the fibroblast-specific protein 1 (Fsp1; also known as S100a4) pro- 
moter’ to label the non-myocyte population (Fig. 1g and Sup- 
plementary Fig. 2a—f). Thus, analogous to induction of cardiac fibro- 
blasts into skeletal muscle in vivo upon introduction of the skeletal 
muscle master regulator, MyoD”, GMT appeared to induce the 
formation of cardiomyocytes in vivo. We determined if endothelial 
and circulating haematopoietic cells marked by Tie2-Cre:R26R-lacZ” 
transgenic mice could be reprogrammed to express sarcomeric markers, 
but found no evidence for such an event (Supplementary Fig. 2g-i). 

We formally tested whether retroviral introduction of GMT into 
non-myocytes could promote cell fusion events in the heart, thereby 
generating «-actinin® B-galactosidase cells. We ‘pulse-labelled’ endo- 
genous CMs in transgenic mice with Cre under inducible control of the 
a-MHC (also known as Myh6) promoter (%-MHC-MerCreMer)™* 
crossed with R26R-EYFP mice (Supplementary Fig. 3). Subsequently, 
hearts were injured and infected retrovirally with GMT and dsRed to 
mark infected dividing cells. After 4 weeks, we detected no YFP* cells 
co-labelled with dsRed in the GMTdsRed- or dsRed-infected hearts 
(Fig. 2a, b and Supplementary Fig. 4a). Because pulse labelling marks 
only ~80% of endogenous CMs in the uninjured heart and ~60% in the 
infarct border zone”®, we quantified the percentage of YFP™ pulse- 
labelled endogenous CMs at the border area. GMT introduction 
resulted in a reduced percentage of YFP" endogenous CMs compared 
to total CMs, indicating that the CMs in this region were refreshed by 
new iCMs (Fig. 2c). These findings suggest that it is unlikely that cell 
fusion makes a major contribution to the o-actinin* B-galactosidase* 
cell population, although a minor contribution cannot be ruled out. 

If o-actinin’ B-galactosidase* cells instead resulted from cellular 
reprogramming, one might detect progressive stages of reprogram- 
ming over time, as cell fusion would yield mature cells soon after fusion 
without intermediate stages. We therefore analysed heart sections 1, 2, 
3 and 4 weeks after injury and GMT infection, and classified cells into 
four groups based on increasing «-actinin expression and organization 
into sarcomeres. The number of o-actinin* B-galactosidase* cells in 
the infarct area increased temporally, as did the maturity of the cells, 
with progressive increases in the percentage of cells with well- 
developed sarcomeres (Supplementary Fig. 5). 

To avoid false positives from overlaying cells due to the thickness of 
the heart sections, we isolated adult CMs at the single-cell level from 
the infarct/border zone of periostin-Cre:R26R-lacZ reprogrammed 
hearts 4 weeks after coronary ligation (Supplementary Fig. 6a). In this 
preparation, non-myocytes were removed, and cells were assayed 
2-4h after primary culture. No CMs isolated from dsRed-injected 
hearts were B-galactosidase’ by immunostaining (Fig. 2d). Similarly, 
CMs from periostin-Cre:R26R-EYFP mice were all YFP, among the 
thousands of cells visualized, in agreement with the absence of perios- 
tin-Cre activity in myocytes after injury. In contrast, 35% of cells in the 
CM preparation from the border/infarct zone were B-galactosidase* 
after GMT injection (Fig. 2e, f and Supplementary Fig. 6b). Among 
the B-galactosidase* cells, 98% were also «-actinin” (Supplementary 
Fig. 7a-d). Furthermore, in hearts co-injected with GMT and dsRed 
retrovirus, B-galactosidase* CMs were also positive for dsRed, indi- 
cating retroviral infection and their likely origin from non-post-mitotic 
CMs (Fig. 2g-k). 

Most B-galactosidase* cells were large, rod-shaped and binucleated, 
closely resembling endogenous CMs that were B-galactosidase from 
the same preparation. In addition to a-actinin, B-galactosidase™ cells 
expressed multiple sarcomeric markers, including tropomyosin 
(Fig. 2m), «-MHC (Fig. 2n), and cardiac troponin T (cTnT; also known 
as Tnnt2) (Fig. 20). Half of the cells had nearly normal sarcomeric 
structures throughout the cell. The full spectrum of reprogrammed 
cells, classified by quality of sarcomeric structure, is shown in 
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Supplementary Fig. 7. Characterization of single dsRed* YFP cells 
derived from the o-MHC-Mer-Cre-Mer-YFP pulse-labelled repro- 
grammed hearts revealed good sarcomere formation and expression 
of o-actinin, cInT and connexin 43 (Cx43; also known as Gja1), like 
dsRed YFP cells (Supplementary Fig. 4b). 

By electron microscopy, about half of the cells from periostin- 
Cre:R26R-Tomato reprogrammed hearts exhibited well-organized 
sarcomeres and mitochondria (Fig. 2p, q), although the sarcomeres 
were consistently shorter than endogenous CMs and their Z-bands 
more diffuse. Other Tomato” cells displayed sarcomeric organization 
in parts of the cell and variable mitochondria organization (Supplemen- 
tary Fig. 8). For simplicity, we will refer to the B-galactosidase* a- 
actinin’ CM-like cells as in vivo iCMs, based on morphology and 
sarcomeric structure. 

Finally, we assessed the reprogramming of gene expression in iCMs 
by qPCR, focusing on the messenger RNA levels of 20 genes normally 
enriched in mature CMs or cardiac fibroblasts. We tested iCMs iso- 
lated from multiple independent hearts alongside cardiac fibroblasts 
and endogenous CMs. mRNA levels in iCMs were similar to CMs 
(Fig. 2r and Supplementary Fig. 9), including the downregulation of 
periostin and Fsp1, consistent with the morphological changes 
described earlier. 


In vivo iCMs electrically mature and couple 


To determine whether iCMs expressed proteins involved in cell-cell 
communication similar to endogenous CMs, we examined the 
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Figure 2 | Cellular analysis of the degree of in vivo cardiac reprogramming. 
a, Immunofluorescent staining for YFP, dsRed and DAPI on heart sections 
from tamoxifen pulse-labelled reprogrammed hearts injected with GMTR. 
Scale bar, 50 um. b, Immunofluorescent staining for YFP, dsRed and DAPI on 
isolated CMs from the infarct/border zone of pulse-labelled reprogrammed 
hearts. Scale bar, 100 um. c, Quantification of the percentage of YFP* 
cardiomyocytes in the infarct/border zone of pulse-labelled mouse hearts 
injected with dsRed (control) or GMT compared to sham (**P < 0.01, n = 3). 
d-f, Immunofluorescence staining for B-galactosidase (BGal) and DAPI on 
isolated CMs from the infarct/border zone of periostin-Cre:R26R-lacZ hearts 4 
weeks after dsRed (d) or GMT (e) injection, with quantification in f. n = 221 
cells from three hearts for dsRed group; n = 182 cells from three hearts for 
GMT group. Scale bar, 200 jum. g, Bright-field image of CMs isolated from 
GMTR- injected periostin-Cre:R26R-lacZ hearts 4 weeks after MI. 

h, j, k, Among these cells, a B-galactosidase-positive cell is shown (h) that also 
co-stained with dsRed (j, k). i, DAPI labels nuclei. Scale bar, 50 um. 

1-o, Immunofluorescent staining for cardiac markers—including «-actinin, 
tropomyosin, cardiac MHC and cTnT—co-labelled with B-galactosidase and 
DAPI, in isolated CMs from the infarct/border zone of periostin-Cre:R26R- 
lacZ hearts 4 weeks after GMT injection. The images display representative 
reprogrammed CMs next to endogenous CMs from the same preparation. 
Quantification of cells with full sarcomere development is shown, with the full 
spectrum of marker expression, localization, and quantification shown in 
Supplementary Fig. 8. White boxes in the merged pictures indicate the areas for 
high magnification images shown in the far right panels. Scale bar, 50 jum for 
the first four columns, 20 um for the last column. p, q, Electron microscopy of 
endogenous CMs or reprogrammed CMs, as identified by the periostin- 
Cre:R26R-Tomato lineage marker (Tomato * CM). Asterisk indicates 
mitochondria and brackets indicate sarcomeric units. Scale bar, 2 um. r, Heat 
map of gene expression for a panel of CM- or fibroblast-enriched genes in 
isolated adult cardiac fibroblasts (CFs), CMs or iCMs based on lineage markers. 
The complete data set with statistics is provided in Supplementary Fig. 9. Error 
bars indicate s.e.m. 


expression pattern of N-cadherin, a cell-surface Ca**-dependent 
adhesion molecule normally found in intercalated disks within the 
myocardium”. We found that over 90% of iCMs expressed 
N-cadherin, with 60% of cells localizing N-cadherin appropriately 
at the cell border (Fig. 3a). Similarly, about 90% of iCMs expressed 
Cx43, the major gap junction protein in the heart that promotes 
electrical coupling and synchronized contraction of myocytes”. 
Half of the iCMs expressed Cx43 at high levels with good localization 
relative to endogenous CMs (Fig. 3b), and in 4% of these cells, the 
Cx43 localization pattern was almost indistinguishable from endo- 
genous CMs (Fig. 3b). Immunohistochemistry also revealed good 
cell-border localization of Cx43 in iCMs (Fig. 3c). 

To assess the function of cell-cell junctions, we imaged the transfer 
of dyes—microinjected through patch pipettes—between cells, and 
measured the intercellular transmission of excitation, via Ca** waves, 
in small groups of cells isolated from the reprogrammed hearts. 
Cascade Blue dextran (molecular weight (MW) 10,000), which is 
too large to pass through gap junctions, was retained in the patched 
iCMs (identified by the lineage marker periostin-Cre:R26R-Tomato). 
In contrast, calcein (MW 600) diffused to interconnected endogenous 
CMs (Fig. 3d). Ca** waves propagating in an iCM or CM excited 
intracellular Ca’* release in neighbouring cells (Fig. 3e), suggesting 
that iCM-CM couplings form functional syncytia (see Supplemen- 
tary Movie 1). 

We next performed recordings from a single-cell suspension of 
CMs isolated from the border/infarct zone of periostin-Cre:R26R- 
EYFP mice transduced with GMT, and subsequently compared the 
action potentials generated by iCMs (YFP*) and endogenous CMs 
(YFP) using standard patch-clamp techniques. Approximately 50% 
of patched iCMs had a physiological resting membrane potential 
(—70 mV or less) and 50% of iCMs exhibited contractions in response 
to electrical stimulation, similar to adult ventricular CMs, which are 
normally quiescent without stimulation (Fig. 3f and Supplemen- 
tary Movie 2). Electrophysiology parameters assayed were similar to 
endogenous ventricular CMs (Fig. 3g). In agreement, intracellular 
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calcium releases and cell shortening in iCMs were comparable to 
endogenous CMs (Fig. 3h). The distribution of action potential dura- 
tions (APDs) was bimodal in iCMs and CMs, suggesting that repro- 
grammed cells were incorporated near the epicardial (short APDoo 
(action potential duration at 90% of repolarization)) and endocardial 
(long APDo9) sides of ventricular tissue (Supplementary Fig. 10). 


In vivo GMT improves cardiac function 


Because in vivo reprogrammed iCMs had contractile potential and 
electrically coupled with viable endogenous CMs (and other iCMs), 
we asked whether converting endogenous non-myocytes into new 
myocytes translates into partial restoration of heart function after 
MI. All studies were performed in a blinded fashion, including the 
retroviral injections, and were decoded only after completion of the 
measurements. By Evans blue/triphenyltetrazolium chloride (TTC) 
double staining, the area at risk (AAR) and the infarct size were 
similar in GMT- or dsRed-injected mice 48 h after coronary ligation 
(Supplementary Fig. 11a). Three months after MI, cardiac function 
was examined by magnetic resonance imaging (MRI). The fraction of 
blood ejected with each ventricular contraction (ejection fraction), the 
volume of blood ejected (stroke volume), and the total cardiac output 
per minute were significantly improved in GMT-infected mice, par- 
ticularly the stroke volume and cardiac output, possibly due to cardiac 
enlargement (Fig. 4a). 'To determine the time course of these improve- 
ments, other mice underwent serial high-resolution two-dimensional 
echocardiography 1 day before MI, and 3 days, 1, 4, 8 and 12 weeks 
after MI (Supplementary Fig.11b-d). All mice showed a comparable 
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Figure 3 | Electrophysiological properties of iCMs. a, b, Immunofluorescent 
staining for N-cadherin (a) or Cx43 (b), co-labelled with B-galactosidase (fGal) 
and DAPI in isolated CMs from periostin-Cre:R26R-lacZ hearts 4 weeks after 
injury. Boxed areas are shown in higher magnification with the per cent of cells 
having the indicated morphology. Green cells represent endogenous CMs, and 
red/orange cells are iCMs. c, Immunohistochemistry for Cx43 on sections from 
the infarct/border zone of periostin-Cre:R26R-lacZ hearts 4 weeks after GMT 
injection. Scale bar, 50 1m in the first and third columns of a and b, and all of 
c; 20 um in the second and fourth columns ofa and b. d, Representative images 
of two CMs in contact with one another, including an iCM (red, cell 1) and an 
endogenous CM (non-red, cell 2) loaded with large (dextran) or small (calcein) 
dye. The large blue dextran dye loaded in the iCM (cell 1) by whole-cell patch- 
clamp method did not travel to the CM (cell 2), but the smaller, gap-junction- 
permeable dye calcein did cross the cell border (n = 5). Scale bar, 50 jim. 

e, Video frames captured from a group of myocytes, including endogenous 
CMs (non-red, cells 1 and 3) and an iCM (red, cell 2) imaged for Fluo-4 
fluorescence transients corresponding to sarcoplasmic reticulum Ca’ * releases. 
Video frames 100 ms apart show that the Ca" release has spread throughout 
the myocyte group, including the iCM (n = 6). Scale bar, 50 jum. f, Intracellular 
electrical recording of in vivo-derived YFP* iCMs and endogenous YFP” CMs 
from the same preparation. E,,, membrane potential in millivolts. g, Table of 
action potential parameters measured for CMs and iCMs, including maximum 
upstroke velocity (AV/Atmax) and minimum diastolic potential (MDP) 
measured immediately preceding stimulation, overshoot potential (OSP), and 
the APDs at 90, 70 and 50% repolarization. h, Characteristic single-field- 
stimulated intracellular calcium transients ([Ca**];) recorded from 
endogenous (left) or induced (right) CMs. Bottom, the simultaneously 
recorded per cent cell shortening responses triggered by the Ca”* transients, in 
the same two cells. Quantifications from 6 iCMs and 4 endogenous CMs are 
shown in the right four panels. NS, not significant. For experiments performed 
in d-h, cells were isolated from periostin-Cre:Rosa-YFP mice 8 weeks post-MI 
and virus transduction. Error bars indicate s.e.m. 


reduction in left ventricular function after coronary artery ligation 
(Supplementary Fig. 11c). Although different imaging approaches 
yield different absolute value norms, the overarching trends observed 
by echocardiography were similar to our MRI findings, in that func- 
tional improvements for all parameters were statistically significant 8 
and 12 weeks after injection (Supplementary Fig. 11c). 

We next performed qPCR to monitor the expression levels of atrial 
natriuretic factor, brain natriuretic peptide and tenascin C in injured 
and control hearts. We found that M1 led to the upregulation ofall three 
peptides, but this upregulation was attenuated in GMT-injected 
infarcted hearts (Fig. 4b). Expression levels of collagen genes, which 
were increased in dsRed-infected MI hearts, were also partially restored 
by injecting GMT (Fig. 4c). Furthermore, the scar area calculated from 
16 sections at four levels of the heart was significantly smaller 8 weeks 
after MI in the GMT-treated group. To determine if the muscle cells in 
the scar area were reprogrammed iCMs, we repeated the experiments in 
periostin-Cre:R26R-LacZ transgenic mice. «-Actinin” cells in the scar 
area were also -galactosidase positive, suggesting that they were newly 
born iCMs of non-myocyte origin (Fig. 4e). Vascular density was sig- 
nificantly increased in the border zone of reprogrammed hearts at 8 
weeks (Supplementary Fig. 12). Electrocardiographic (ECG) studies 
(telemetry) over a 24-h period did not indicate evidence for more 
arrhythmias in GMT- versus dsRed-injected control mice, and no mice 
suffered sudden death (not shown). 


Thymosin $4 enhances effects of GMT in vivo 


We hypothesized that infecting more Thyl”* cells would enhance 
functional improvement. Thymosin B4, a 43-amino-acid G-actin 
monomer-binding protein, promotes cell migration**”’, cardiac cell 
survival’**° and activates epicardial cells to become more proliferative 
and yield more cardiac fibroblasts and endothelial cells*’”*. It also 
improves cardiac function and decreases scar size after MI’*. To test 
cardiac fibroblast migration, we used a cardiac explant migration 
assay’*. The average time for fibroblasts to migrate from adult heart 
explants was 3 weeks; however, thymosin {4 treatment led to equi- 
valent fibroblast migration within 2 weeks and within only 3 days in 
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Figure 4 | In vivo delivery of cardiac reprogramming factors improves 
cardiac function after myocardial infarction. a, Ejection fraction (EF), stroke 
volume (SV) and cardiac output (CO) of the left ventricle were quantified by 
MRI 12 weeks after MI (n = 9 for each group, *P < 0.05). Left four panels show 
representative transverse images of the thorax, containing hearts at the end of 
diastole (relaxation) or systole (contraction) from dsRed- or GMT-injected 
mice, compared to sham-operated age- and strain-matched controls. LV, left 
ventricle; RV, right ventricle. b, qPCR of atrial natriuretic factor (ANF), brain 
natriuretic peptide (BNP) and tenascin (TnC) on RNA extracted from the 
border zone of hearts 4 weeks after MI and injection of dsRed or GMT. RQ, 
relative quantification. c, qPCR of collagen type I alpha 1 (Colla1), Colla2, 
Col3a1 and elastin (Eln) on RNA extracted from the border zone of hearts 4 
weeks after MI and injection of dsRed or GMT. Data in b and c are relative to 
dsRed-injected sham-operated mice, indicated by the dashed line. n = 3 for 
each genotype with technical quadruplicates. *P < 0.05. d, Masson-Trichrome 
staining on heart sections 8 weeks post-MI injected with dsRed or GMT with 
quantification of scar size. Scale bars, 500 tm. dsRed, n = 8; GMT, n = 9; 

*P < 0.05. e, Masson-Trichrome (left) and immunofluorescent staining for 
a-actinin and/or B-galactosidase (BGal; right) in GMT injected periostin- 
Cre:R26R-lacZ mouse heart 4 weeks post-surgery. Scale bars, 500 jm in the left 
panel, 50 um in the right three panels. Error bars indicate s.e.m. 


heart explants taken after MI (Supplementary Fig. 13a). Similarly, 
vimentin” cell proliferation was even more pronounced post-MI in 
the presence of thymosin 84, as marked by phosphohistone H3 
(Supplementary Fig. 13b). The percentage of Thy1* (Supplementary 
Fig. 13c) or vimentin’ (Supplementary Fig. 14a) cells infected by 
retrovirus after MI doubled upon intramyocardial thymosin 4 
injection (316,607 + 17,799 dsRed* cells averaged from three hearts). 
GMT did not increase the number of cardiac fibroblasts (Supplemen- 
tary Fig. 13d). Delivering GMT-expressing retrovirus to more cells by 
adding thymosin {4 yielded more B-galactosidase* iCMs compared to 
total CMs in single-cell CM culture from the infarct/border zone of 
periostin-Cre:R26R-lacZ hearts (51% versus 35%) (Supplementary 
Fig. 13e). However, we observed no change in the degree or efficiency 
of in vivo reprogramming, with the percentage of iCMs generated from 
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the total GMT-infected cell population remaining at ~12% (Sup- 
plementary Figs 13e and 14b). 

Injecting thymosin B4 immediately after ligation improved cardiac 
function, as previously reported”. Co-injecting thymosin 4 and 
GMT further improved ejection fraction and cardiac output 8 weeks 
after infarction (Supplementary Figs 13f and 14d, e). Furthermore, co- 
injecting thymosin B4 and GMT caused less scarring than injecting 
either alone (Supplementary Fig. 13g), despite similar areas at risk and 
initial infarct sizes (Supplementary Fig. 14c). 


Discussion 


We show that upon cardiac injury, resident cardiac non-myocytes— 
primarily fibroblasts—can be converted into CM-like cells in vivo 
following local delivery of GMT by retroviral-mediated gene transfer. 
In vivo cardiac reprogramming occurred with similar initial efficiency 
as observed in vitro (10-15%). However, in vivo iCMs were more fully 
reprogrammed and more closely resembled endogenous CMs than 
their cultured counterparts. This may result from factors within the 
native microenvironment—including extracellular matrix, secreted 
proteins, and tissue stiffness—that further enhance reprogramming. 
Improved cardiac function may be explained by the diversion of a 
small percentage of fibroblasts into new CM-like cells, suggesting 
functional integration of these muscle cells. Although non-myocytes 
convert to iCMs to help regenerate the damaged heart, alteration of 
fibroblast behaviour by GMT may contribute to the effects on scar 
formation and cardiac function. Although it is difficult to separate the 
relative contributions of new muscle formation and other non-cell- 
autonomous effects, non-myocyte reprogramming in the heart 
appears to be beneficial for cardiac function. Optimizing gene delivery 
to more cells would probably enhance functional benefits. 

Improvement upon thymosin [4 addition is in agreement with the 
notion that increasing the delivery of GMT to more cells could 
enhance cardiac repair. Pre-treating hearts with thymosin [4 several 
days before injury resulted in a small population of epicardial-derived 
cells that could behave as myocyte precursors, but not if thymosin B4 
was given at the time of injury’’. Nevertheless, transduction of GMT 
into these progenitors, or other rare progenitors yet to be identified, 
might promote their differentiation into cardiomyocytes. Because 
thymosin £4 is also pro-angiogenic*’*’, the cooperativity between 
GMT and thymosin B4 may be multifaceted and will be interesting 
to explore. 

The ability to regenerate adult heart tissue from endogenous cells is 
a promising approach to treating cardiac disease that may face fewer 
obstacles to clinical translation than other approaches. Improving 
the delivery of reprogramming factors, using small molecules and 
epigenetic modulators, and conducting trials in large animals will 
be important to refine the technology and assess its safety and efficacy, 
particularly regarding arrhythmias. 


METHODS SUMMARY 


Retroviruses. Retroviruses were generated as described’ using pMXs retroviral 
vectors containing coding regions of Gata4, Mef2c, Tbx5 and dsRed. Ultra-high 
titre virus (>1 X 10'° plaque-forming units (p.f.u.) per ml) was obtained by 
ultracentrifugation. 

Animals, surgery, echocardiography and electrocardiography. Periostin- 
Cre:R26R-lacZ or Fsp1-Cre:R26R-lacZ mice were obtained by crossing 
periostin-Cre mice’? or Fspl-Cre mice’! and R26R-lacZ mice”. Periostin- 
Cre:R26R-EYFP or o&-MHC-MerCreMer:R26R-EYFP mice were obtained by 
crossing periostin-Cre or o-MHC-MerCreMer™* mice and R26R-EYFP mice. 
Periostin-Cre:R26R-Tomato mice were obtained by crossing periostin-Cre mice 
and R26R-Tomato mice. Surgeries and subsequent analyses were performed 
blinded to genotype and intervention. MI was induced by permanent ligation 
of the left anterior descending artery (LAD) as described*. A pool of concentrated 
virus (GMT or GMTR) was mixed, and 10 ul of mixed virus plus 10 pil of PBS or 
40 ng pl ' thymosin B4 were injected along the boundary between the infarct and 
border zones. Mouse echocardiography and surface electrocardiography were 
performed as described**. All mouse work was done with the approval of the 
University of California, San Francisco (UCSF) animal care oversight committee. 
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Immunohistochemistry, immunocytochemistry and electron microscopy. 
Immunohistochemistry, immunocytochemistry and electron microscopy were 
performed as described**”*. Scar size was determined by Masson-Trichrome 
staining***. The AAR and myocardial infarct size were determined by Evans 
blue/TTC labelling™*. 

Cardiomyocyte isolation, patch-clamp, and fibroblast migration assays. Adult 
cardiomyocytes were isolated as described with minor modifications”*. Single-cell 
patch-clamp recordings were performed as described”. Migration assays were 
performed according to published protocols””*. 

FACS and quantitative RT-PCR. Dissociated cardiac cells were stained with 
APC-conjugated anti-Thy1 antibody (eBioscience). Stained cells were sorted by 
FACSAria2 (BD) and RNA extracted in TRizol (Invitrogen). qPCR was per- 
formed using ABI 7900HT (TaqMan, Applied Biosystems). 

Statistics. Differences between groups were examined for statistical significance 
using unpaired Student's t-tests or ANOVA. P < 0.05 was regarded as significant. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Retrovirus generation, concentration, and titration. Retroviruses were generated 
as described’. To generate virus, pMXs retroviral vectors containing the coding 
regions of Gata4, Mef2c, Tbx5 and dsRed were transfected into Plat-E cells using 
Fugene 6 (Roche). Forty-eight hours after transfection, virus-containing superna- 
tants were collected and concentrated by standard ultracentrifugation. Retroviral 
titration was performed using the Retro-X qRT-PCR Titration Kit (Clontech), as 
per the manufacturer’s protocols. Ultra-high titre virus (>1 x 10'° plaque-forming 
units (p.f.u.) per ml) was resuspended in PBS. After verification of high transduction 
efficiency in cell culture (>95%), a large number of small stock aliquots (10 jl) were 
made and frozen at —80 °C to ensure consistency among experiments. After one 
freeze-thaw cycle, titrations were repeated to ensure that active virus was maintained 
at the desired 1 X 10'° p.f.u. concentration for in vivo injection. 

Mouse lines. Periostin (Postn)-Cre:R26R-lacZ mice were obtained by crossing 
Postn-Cre mice and Rosa26-lacZ mice. Postn-Cre:R26R- YFP mice were obtained 
by crossing Postn-Cre mice and Rosa26-EYFP mice, and Postn-Cre:R26R- 
Tomato mice were obtained by crossing Postn-Cre mice and Rosa26-Tomato 
mice. All transgenic lines for immunohistochemistry and single cell isolation 
were maintained by crossing with C57BL6 mice (Charles River). BALB/C mice 
(Charles River) were used for all functional studies after permanent ligation of 
the left anterior descending artery (LAD) and virus injection. Fsp1-Cre, 
Tie2-Cre and Myh6-MerCreMer mice were obtained from Jackson Labs, and 
lines were validated before further breeding. Fsp1-R26R, Tie2-R26R and 
Myh6-MerCreMer-YFP mice were obtained by crossing Fsp1-Cre, Tie2-Cre or 
Myh6-MerCreMer mice to R26R-lacZ or R26R-EYFP mice. Efficiency of Cre 
recombination induction for Myh6-MerCreMerYFP was tested by immunohisto- 
chemistry for YFP after injection of various doses of tamoxifen. To pulse label the 
pre-existing CMs, adult Myh6-MerCreMerYFP mice (8-12-weeks old) were 
treated with tamoxifen (Sigma) by intraperitoneal injection once a day for 5 days 
at a dosage of 20 mg kg ‘day '. GMT delivery and coronary artery ligation were 
performed 2 days afterwards. 

Mouse MI model and in vivo delivery. The animal protocol for surgery was 
approved by institutional guidelines (UCSF Institutional Animal Care and Use 
Committee). All surgeries and subsequent analyses were performed blinded for 
genotype and intervention. Mice were anaesthetized with 2.4% isoflurane/97.6% 
oxygen and placed in a supine position on a heating pad (37 °C). Animals were 
intubated with a 19G stump needle and ventilated with room air using a 
MiniVent Type 845 mouse ventilator (Hugo Sachs Elektronik-Harvard 
Apparatus; stroke volume, 250 ll; respiratory rate, 120 breaths per minute). MI 
was induced by permanent ligation of the LAD with a 7-0 prolene suture as 
described’*. Sham-operated animals served as surgical controls and were sub- 
jected to the same procedures as the experimental animals with the exception that 
the LAD was not ligated. A pool of concentrated virus (GMT, or GMTR) was 
mixed, and 10 ull of mixed virus plus 10 ll of PBS or 40 ng pl’ thymosin £4 was 
injected into the myocardium through an insulin syringe with an incorporated 
29 G needle (BD). Injection with a full dosage was carried out along the boundary 
between the infarct zone and border zone based on the blanched infarct area after 
coronary artery occlusion. After injection, the chest was closed with sutures and 
the mouse was allowed to recover with the mouse ventilator and heating pad. All 
surgical procedures were performed under aseptic conditions. At 2 days and 1, 2, 
4, 8 and 12 weeks after occlusion and viral delivery, the hearts were removed for 
perfusion fix in 4% paraformaldehyde (PFA) for preparation of paraffin sections 
for structural analysis and immunohistochemistry or in 0.5% PFA in 5% sucrose 
followed by cryostat sectioning for immunofluorescent staining. Concurrently, 
heart tissues within the infarct zone, border zone and non-ischaemic zone distal 
to the infarct zone were dissected for RNA or protein isolation. 

Determination of the AAR and MI size. At 48 h after coronary ligation, the mice 
were anaesthetized and cannulated with tubing. Evans blue (2%, Sigma) was 
perfused into the aorta, thus all myocardial tissue was stained blue except the 
AAR. The left ventricle was isolated and cut into four ~1 mm pieces with the first 
cut at the ligation level. Left ventricle slices were stained in 1.5% TTC for 30 min at 
37 °C, and then fixed in 4% PFA overnight at 4°C. The area of infarction was 
demarcated as a white area, whereas viable myocardium was stained red. 
Photographs were taken from both sides of each section. The AAR and the infarct 
area were determined via planimetry with the computer software ImagePro 
(Biorad). Infarct size was calculated as the percentage of MI compared with the 
AAR using the described methodology™. 

Determination of scar size. Standard Masson-Trichrome staining was per- 
formed on hearts 8 weeks post-viral delivery and coronary artery ligation. To 
determine the scar size, we used ImagePro software to measure the scar area 
(blue) and healthy area (red) on transverse sections spanning four levels 
(50 mm between two levels, with the first level starting right below the ligation) 
within the left ventricle of a MI heart. From each level, we measured four slices of 
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tissues as technical quadruplicates (for a total of 16 sections). The averaged 
number was used for statistics and comparison. The measurements and calcula- 
tions were conducted in a blinded manner. 

Mouse echocardiography. Echocardiography was performed by the Vevo 770 
High-Resolution Micro-Imaging System (VisualSonics) with a 15-MHz linear 
array ultrasound transducer. The left ventricle was assessed in both parasternal 
long-axis and short-axis views at a frame rate of 120 Hz. End-systole or end- 
diastole was defined as the phase in which the smallest or largest area of the left 
ventricle, respectively, was obtained and used for ejection fraction measurement. 
Left ventricular end-systolic diameter and left ventricular end-diastolic diameter 
were measured from the left ventricular M-mode tracing with a sweep speed of 
50mms "at the papillary muscle level for calculating the shortening fraction. 
B-mode was used for two-dimensional measurements of end-systolic and end- 
diastolic dimensions. 

Mouse surface electrocardiography. Mice were anaesthetized with 1.75% iso- 
flurane at a core temperature of 37-38°C. Four needle electrodes (AD 
Instruments) were placed subcutaneously in standard limb lead configurations. 
For each mouse, 10-20 s of continuous signals were sampled at 10 kHz in each 
lead configuration with a PowerLab4/30 interface (AD Instruments). Data ana- 
lysis was performed offline with electronic calipers on averaged beats (Chart5Pro 
v5.4.2, AD Instruments). 

Mouse awake electrocardiography. To record awake electrocardiograms in six 
reprogrammed post-MI mice and six control post-MI mice, transmitters were 
surgically implanted according to the manufacturer’s instructions (Data Sciences 
International). After a 3-day recovery period, the electrocardiogram was recorded 
continuously for 48h in each mouse. Tracings were analysed off-line and were 
scored by a blinded investigator for the presence and frequency of arrhythmias. 
MRI. MRI was performed on a Varian DirectDrive 7T small-animal scanner. 
Each mouse was anaesthetized by inhalation of 2% isoflurane/98% oxygen admi- 
nistered via an MR-compatible mobile inhalation anaesthesia system (Vet 
Equip). The mice were put in supine position on a homemade heating bed to 
keep the temperature at 37 °C. Two ECG leads were inserted into the right front 
and left rear leg. ECG waveforms were monitored with a small animal monitoring 
and gating system (SA instruments). The mouse was then placed into a home- 
made 1H birdcage coil with an inner diameter of 32 mm. A group of ECG- (R- 
wave rising edge) triggered spin echo scout images were acquired first to define 
the oblique plane of the short axis. Then an ECG-triggered two-dimensional 
gradient echo sequence with an echo time of 2.75 ms, repetition time of 200 ms 
and a flip angle of 45° was used to obtain nine short-axis images at 12 or 13 phases 
per cardiac cycle. Each scan consisted of 8-9 contiguous slices spanning the left 
ventricle from apex to base with 1-mm thickness, a matrix size of 128 X 128, a 
field of view of 25.6 X 25.6 mm, and four averages. 

Isolation of adult CMs. Adult CM isolation was performed as described with 
minor modifications”®. Briefly, adult mice were anaesthetized with isoflurane and 
mechanically ventilated. Hearts were removed and perfused retrogradely via 
aortic cannulation with a constant flow of 3 ml min” ' ina Langendorf apparatus. 
Hearts were perfused at 37 °C for 5 min with supplemented Wittenberg Isolation 
Medium (WIM) containing (in mM): 116 NaCl, 5.4 KCL, 6.7 MgCl, 12 glucose, 2 
glutamine, 3.5 NaHCOs, 1.5 KH,PO,, 1.0 NaH,PO,, 21 HEPES, with 1.5nM 
insulin, essential vitamins (GIBCO), and essential amino acids (GIBCO) (pH 
7.4), followed by digestion solution (WIM, supplemented with 0.8 mg ml ' col- 
leganse II and 10 1M CaCl,) for 10 min (4 min for paired CMs that were used in 
cell-cell coupling experiments). Hearts were then removed from the Langendorf 
apparatus while intact (with tissues loosely connected). Desired areas (that is, 
border/infarct zone) were then micro-dissected under the microscope, followed 
by mechanical dissociation, triturating, and resuspension in a low-calcium solu- 
tion (WIM, supplemented with 5 mg ml! BSA, 10mM taurine, and 150 pM 
CaCl,). Cells were then spun at low speed, supernatant was removed, and calcium 
was gradually reintroduced through a series of washes. For electrophysiology 
experiments, cells were used on the same day as isolation and until recording 
were stored at room temperature (21°C) in M199 (Gibco) supplemented with 
5mM creatine, 2mM L-carnitine, 5mM taurine and 1.5nM insulin. For 
immunohistochemistry, cells were plated onto laminin-coated culture slides, 
allowed to adhere, and fixed on the day of isolation. For electron microscopy 
or qPCR, iCMs were selected manually by micro-pipette based on the presence of 
periostin-Cre:R26R-YFP/Tomato signal under the fluorescent microscope right 
after isolation. 

Cardiac fibroblast migration assay. The migration assay was performed accord- 
ing to the explant culture protocol as described". In brief, isolated adult mouse 
hearts were minced into small pieces less than 1 mm’ in size. The explants were 
plated on gelatin-coated dishes and cultured in explant medium (IMDM, 20% 
FBS) until fibroblasts migrated out from minced tissue. The time required for ten 
heart pieces to have migratory fibroblasts surrounding them were recorded. 
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FACS analyses and sorting. At 48 h after LAD and viral introduction, hearts were 
removed and minced into small pieces less than 1 mm’ in size. Blood cells and 
debris were removed by several washings of PBS. Minced cardiac tissues were 
digested in an eppendorf tube and shaken with glass beads in enzyme buffer 
(collagenase/dispase plus DNase; Roche) at 37 °C. After passing through a 40- 
uum cell strainer, dissociated cardiac cells were stained with APC-conjugated anti- 
Thy] antibody (eBioscience) for 30 min at room temperature. After washing with 
PBS twice, stained cells were sorted by FACSAria2 (BD). 
Immunohistochemistry. For immunofluorescence, after perfusion-fixed hearts 
were taken out and fixed in 0.5% PFA overnight, ventricles below the ligation 
were embedded in OCT compound and frozen in liquid nitrogen. Sections were 
blocked in Universal Blocking Buffer (BioGenex) for 10 min, and then stained 
with primary antibodies against «-actinin (Sigma Aldrich), vimentin (Progen), 
B-galactosidase (Abcam), pH3 (Millipore), RFP (Biovision), Thyl (BD), CD34 
(Abcam), WT1 (Abcam), PECAM (BD), NG2 (Millipore) and GFP (Invitrogen) 
for 1 h at room temperature. After washing three times with PBST (PBS plus 0.1% 
Triton), sections were incubated in secondary antibodies for 1h followed by 
washing an additional three times with PBST. Finally, the sections were mounted 
in Vectashield with DAPI (Vector Laboratories). 

Immunocytochemistry. Isolated cells plated on chamber slides were fixed in 4% 
PFA at 4 °C overnight and washed with PBS twice. Cells were then incubated with 
primary antibodies against sarcomeric o-actinin (Sigma Aldrich), cTnT (Thermo 
Scientific), RFP (Biovision), tropomyocin (Hybridoma Bank), MF20 (Hybridoma 
Bank), Cx43 (Sigma Aldrich), N-cadherin (Invitrogen), B-galactosidase (Abcam), 
smooth muscle actin (Sigma Aldrich) and vimentin (Progen) for 1h at room 
temperature and washed with PBS three times, then incubated with secondary 
IgG antibodies conjugated to Cy 488 or 594 (Jackson Immunoresearch) for 
30 min. After washing with PBS, cells were mounted in Vectashield with DAPI. 
Electron microscopy. For electron microscopy, cells were fixed in 2% glutaral- 
dehyde, 1% paraformaldehyde in 0.1 M sodium cacodylate buffer, pH 7.4, post 
fixed in 2% osmium tetroxide in the same buffer, en bloc stained with 2% aqueous 
uranyl acetate, dehydrated in acetone, infiltrated, and embedded in LX-112 resin 
(Ladd Research Industries). Samples were ultrathin sectioned on a Reichert 
Ultracut S ultramicrotome and counter stained with 0.8% lead citrate. Grids were 
examined on a JEOL JEM-1230 transmission electron microscope (JEOL USA) 
and photographed with the Gatan Ultrascan 1000 digital camera (Gatan). 
Action potential recordings. Isolated myocytes suspended in tissue culture 
medium were transferred to a superfusion chamber (RC-26GLP; Warner 
Instruments) on the stage of a Nikon TiS inverted fluorescence microscope 
equipped with a dual wavelength microfluorometer (IonOptix). Myocytes iso- 
lated from either periostin-Cre:R26R-YFP or periostin-Cre:R26R-Tomato hearts 
after GMT infection were identified as control CMs or iCMs on the basis of 
Tomato fluorescence (Texas red optics with exciter 560+55nm, emitter 
645 + 75nm) or YFP (standard FITC optics), and were chosen for study if they 
lacked spontaneous beating and partial contractures. The myocytes were whole- 
cell patch-clamped using an Axopatch 200B amplifier and pClamp software 
(Molecular Devices). Patch electrodes of 2-5 MQ (1B-150F; WPI) were filled 
with intracellular solution containing 120mM KCl, 20mM NaHEPES, 10 mM 
MgATP, 5 mM K3EGTA (or 0.1 mM), 2mM MgCh, and adjusted to pH 7.1 with 
KOH. The cells were superfused with a modified Tyrode’s extracellular solution 
containing 137 mM NaCl, 10 mM NaHEPES, 10 mM dextrose, 5 mM KCl, 2 mM 
CaCl,, 1mM MgCh, adjusted to pH 7.4 with NaOH. After GQ seal formation, 
whole-cell access to the myocyte was established by applying brief pressure pulses, 
and the amplifier was switched to current clamp mode, whereupon the cell’s 
resting potential developed. Action potentials were stimulated at 0.33 Hz using 
2nA, 2ms current pulses applied through the patch pipette, and were signal 
averaged in tens. All membrane potentials were corrected for a —5.6 mV liquid 
junction potential determined via pClamp software. Finally, the amplifier was 
switched back to voltage clamp mode to identify individual ion channel currents 
as required. In some experiments, the concentration of K,EGTA in the intracel- 
lular solution was reduced to 0.1 mM to permit excitation-contraction coupling 
to occur, and 100 uM K5Fluo-4 (Invitrogen Corp) was added to define the pres- 
ence of cytosolic Ca” transients during action potentials. Electrophysiology data 
were digitized at 5 kHz and low-pass filtered at 2 kHz. Analysis was performed 
using pClamp, Microsoft Excel, and Origin (OriginLab) software. Action poten- 
tial duration was measured from the point of maximum depolarizating voltage 
change (AV/Afyyax) to 50, 70 and 90% repolarization. Unless stated, experiments 
were performed at room temperature. 


Field stimulation experiments. Isolated myocytes were loaded with Fluo-4 for 
30 min at room temperature before being transferred to the superfusion chamber. 
The loading solution contained a 1:10 mixture of 5 mM Fluo-4 AM in dry DMSO 
and PowerloadTM concentrate (Invitrogen) which was diluted 100-fold into 
extracellular Tyrode’s solution containing suspended myocytes. An additional 
20min was allowed for de-esterification before commencing recordings. 
Contractions and Ca** transients were evoked by applying voltage pulses at 
0.33 Hz, between platinum wires placed on either side of the cell of interest and 
connected to a field stimulator (lonOptix, Myopacer). The pulses were of 2 ms 
duration and set at 150% of the threshold required to elicit twitches. Fluo-4 
fluorescence transients were recorded via a standard filter set (#49011 ET, 
Chroma Technology) in batches of ten to enable signal averaging. Between stimuli, 
the fluorescence excitation light was blocked by an electromechanical shutter 
(CS35; Vincent Associates). Resting fluorescence was recorded after cessation of 
pacing, and background light was obtained after picking up and removing the cell 
from the field of view with a patch electrode at the end of the experiment. The Ca** 
transients were calibrated using the pseudo-ratio method”, assuming an in situ 
dissociation constant of 1.1 1M for Fluo-4*°. Contractions were optically recorded 
simultaneously with Ca’ transients by illuminating the cell of interest in red light 
(A> 665 nm) subsequently directed to a CCD camera (IonOptix Myocam). The 
cell length signals were converted to voltage via a video motion director (VED 205; 
Crescent Electronics) and contraction amplitudes from different myocytes were 
normalized by calculating the per cent change in cell length. 

Determination of cell-cell coupling. In experiments to assess the interconnec- 
tivity between iCMs and CMs, the whole-cell patch-clamp method was used to 
introduce a gap-junction-permeable (that is, mobile) and an impermeable (that 
is, immobile) dye into the same cell of interest within a small group of apparently 
coupled isolated myocytes (n = 5 groups from 5 independent hearts). The mobile 
dye was calcein (5 mM), and the immobile dye was 1 mM dextran-conjugated 
Cascade Blue (MW 10,000). The immobile dye was chosen to be well separated 
spectrally, both from calcein and from tomato, which was used for labelling iCMs 
(Invitrogen). The dye pair was included in standard intracellular solution (con- 
taining 5 mM EGTA) and cytoplasmic loading was allowed to proceed for 2 min, 
after which the patch electrode was withdrawn from the patched myocyte. The 
sarcolemma of the cell resealed after pulling off the pipette, aided by high EGTA 
levels in the filling solution, thereby trapping the dyes in the cytoplasm. Blue 
fluorescence from the immobile indicator was excited at 365 + 40 nm, whereas 
calcein fluorescence was excited at 470+ 40nm. Fluorescent images were 
recorded using IonOptix Myocam via a video frame grabber (#166VCB, 
Hauppage) for processing using ImageJ software. 

The functionality of cell-cell junctions was investigated by imaging the inter- 
cellular transmission of Ca”* waves and excitation between myocytes pre-loaded 
with Fluo-4 AM as detailed above for the field-stimulation studies. Superfusion 
with 2 1M ouabain for 5-10 min was used to induce intracellular Ca** overload 
accompanied by Ca** wave activity, and videomicroscopy revealed the spatio- 
temporal relationships of Ca** waves translocating within and between the indi- 
vidual cells imaged in small, adherent groups (1 = 6 groups from 5 independent 
hearts). 

Quantitative RT-PCR. iCMs were manually sorted based on the presence of 
fluorescent lineage markers. Approximately 100 iCMs were pooled for RNA 
isolation. Similarly, ~100 endogenous CMs isolated using the standard 
Langendorf apparatus (see above) and ~500 cardiac fibroblasts using the migra- 
tion assay (see above) were prepared for RNA isolation. RNA was extracted by the 
TRizol method (Invitrogen). RT-PCR was performed using the Superscript III 
first-strand synthesis system (Invitrogen). qPCR was performed using the ABI 
7900HT (TaqMan, Applied Biosystems) as per the manufacturer’s protocols. 
Optimized primers from the Taqman Gene Expression Array were used. 
Statistical analyses. Differences between groups were examined for statistical 
significance using unpaired Student’s t-test or ANOVA. A P-value < 0.05 was 
regarded as significant. Error bars indicate standard error of the mean (s.e.m.). 
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